Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.

Journal Article

Full Text

Duke Authors

Cited Authors

  • Agazzi, A; Lu, J

Published Date

  • 2021

Published In

  • Iclr

Published By