Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime
Publication
, Journal Article
Agazzi, A; Lu, J
Duke Scholars
Citation
APA
Chicago
ICMJE
MLA
NLM
Agazzi, A., & Lu, J. (n.d.). Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.
Agazzi, Andrea, and Jianfeng Lu. “Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime,” n.d.
Agazzi, Andrea, and Jianfeng Lu. Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.