Scholars@Duke publication: A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE

A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE

Publication , Journal Article

Zhou, M; Lu, J

Published in: SIAM Journal on Control and Optimization

January 1, 2025

We consider policy gradient methods for stochastic optimal control problems in continuous time. In particular, we analyze the gradient flow for the control, viewed as a continuous time limit of the policy gradient method. We prove the global convergence of the gradient flow and establish a convergence rate under some regularity assumptions. The main novelty in the analysis is the notion of local optimal control function, which is introduced to characterize the local optimality of the iterate.

Duke Scholars

Author Jianfeng Lu Mathematics

Published In

SIAM Journal on Control and Optimization

DOI

10.1137/23M1570739

ISSN

0363-0129

Publication Date

January 1, 2025

Volume

Issue

Start / End Page

2605 / 2631

Related Subject Headings

Industrial Engineering & Automation
5108 Quantum physics
4901 Applied mathematics
0913 Mechanical Engineering
0906 Electrical and Electronic Engineering
0102 Applied Mathematics

Citation

APA

Chicago

ICMJE

MLA

NLM

Zhou, M., & Lu, J. (2025). A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization, 63(4), 2605–2631. https://doi.org/10.1137/23M1570739

Zhou, M., and J. Lu. “A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE.” SIAM Journal on Control and Optimization 63, no. 4 (January 1, 2025): 2605–31. https://doi.org/10.1137/23M1570739.

Zhou M, Lu J. A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization. 2025 Jan 1;63(4):2605–31.

Zhou, M., and J. Lu. “A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE.” SIAM Journal on Control and Optimization, vol. 63, no. 4, Jan. 2025, pp. 2605–31. Scopus, doi:10.1137/23M1570739.

Zhou M, Lu J. A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization. 2025 Jan 1;63(4):2605–2631.

Published In

SIAM Journal on Control and Optimization

DOI

10.1137/23M1570739

ISSN

0363-0129

Publication Date

January 1, 2025

Volume

Issue

Start / End Page

2605 / 2631

Related Subject Headings

Industrial Engineering & Automation
5108 Quantum physics
4901 Applied mathematics
0913 Mechanical Engineering
0906 Electrical and Electronic Engineering
0102 Applied Mathematics