A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE
Publication
, Journal Article
Zhou, M; Lu, J
Published in: SIAM Journal on Control and Optimization
January 1, 2025
We consider policy gradient methods for stochastic optimal control problems in continuous time. In particular, we analyze the gradient flow for the control, viewed as a continuous time limit of the policy gradient method. We prove the global convergence of the gradient flow and establish a convergence rate under some regularity assumptions. The main novelty in the analysis is the notion of local optimal control function, which is introduced to characterize the local optimality of the iterate.
Duke Scholars
Published In
SIAM Journal on Control and Optimization
DOI
ISSN
0363-0129
Publication Date
January 1, 2025
Volume
63
Issue
4
Start / End Page
2605 / 2631
Related Subject Headings
- Industrial Engineering & Automation
- 5108 Quantum physics
- 4901 Applied mathematics
- 0913 Mechanical Engineering
- 0906 Electrical and Electronic Engineering
- 0102 Applied Mathematics
Citation
APA
Chicago
ICMJE
MLA
NLM
Zhou, M., & Lu, J. (2025). A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization, 63(4), 2605–2631. https://doi.org/10.1137/23M1570739
Zhou, M., and J. Lu. “A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE.” SIAM Journal on Control and Optimization 63, no. 4 (January 1, 2025): 2605–31. https://doi.org/10.1137/23M1570739.
Zhou M, Lu J. A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization. 2025 Jan 1;63(4):2605–31.
Zhou, M., and J. Lu. “A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE.” SIAM Journal on Control and Optimization, vol. 63, no. 4, Jan. 2025, pp. 2605–31. Scopus, doi:10.1137/23M1570739.
Zhou M, Lu J. A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization. 2025 Jan 1;63(4):2605–2631.
Published In
SIAM Journal on Control and Optimization
DOI
ISSN
0363-0129
Publication Date
January 1, 2025
Volume
63
Issue
4
Start / End Page
2605 / 2631
Related Subject Headings
- Industrial Engineering & Automation
- 5108 Quantum physics
- 4901 Applied mathematics
- 0913 Mechanical Engineering
- 0906 Electrical and Electronic Engineering
- 0102 Applied Mathematics