Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.

Journal Article

Full Text

Duke Authors

Cited Authors

  • Agazzi, A; Lu, J

Published Date

  • 2020

Published In

  • Corr

Volume / Issue

  • abs/2010.11858 /