Skip to main content

A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE

Publication ,  Journal Article
Zhou, M; Lu, J
Published in: SIAM Journal on Control and Optimization
January 1, 2025

We consider policy gradient methods for stochastic optimal control problems in continuous time. In particular, we analyze the gradient flow for the control, viewed as a continuous time limit of the policy gradient method. We prove the global convergence of the gradient flow and establish a convergence rate under some regularity assumptions. The main novelty in the analysis is the notion of local optimal control function, which is introduced to characterize the local optimality of the iterate.

Duke Scholars

Published In

SIAM Journal on Control and Optimization

DOI

ISSN

0363-0129

Publication Date

January 1, 2025

Volume

63

Issue

4

Start / End Page

2605 / 2631

Related Subject Headings

  • Industrial Engineering & Automation
  • 5108 Quantum physics
  • 4901 Applied mathematics
  • 0913 Mechanical Engineering
  • 0906 Electrical and Electronic Engineering
  • 0102 Applied Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhou, M., & Lu, J. (2025). A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization, 63(4), 2605–2631. https://doi.org/10.1137/23M1570739
Zhou, M., and J. Lu. “A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE.” SIAM Journal on Control and Optimization 63, no. 4 (January 1, 2025): 2605–31. https://doi.org/10.1137/23M1570739.
Zhou M, Lu J. A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization. 2025 Jan 1;63(4):2605–31.
Zhou, M., and J. Lu. “A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE.” SIAM Journal on Control and Optimization, vol. 63, no. 4, Jan. 2025, pp. 2605–31. Scopus, doi:10.1137/23M1570739.
Zhou M, Lu J. A POLICY GRADIENT FRAMEWORK FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS WITH GLOBAL CONVERGENCE GUARANTEE. SIAM Journal on Control and Optimization. 2025 Jan 1;63(4):2605–2631.

Published In

SIAM Journal on Control and Optimization

DOI

ISSN

0363-0129

Publication Date

January 1, 2025

Volume

63

Issue

4

Start / End Page

2605 / 2631

Related Subject Headings

  • Industrial Engineering & Automation
  • 5108 Quantum physics
  • 4901 Applied mathematics
  • 0913 Mechanical Engineering
  • 0906 Electrical and Electronic Engineering
  • 0102 Applied Mathematics