Scholars@Duke publication: Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime

Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime

Publication , Journal Article

Agazzi, A; Lu, J

Duke Scholars

Author Jianfeng Lu Mathematics

Citation

APA

Chicago

ICMJE

MLA

NLM

Agazzi, A., & Lu, J. (n.d.). Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.

Agazzi, Andrea, and Jianfeng Lu. “Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime,” n.d.

Agazzi A, Lu J. Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.

Agazzi, Andrea, and Jianfeng Lu. Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.

Agazzi A, Lu J. Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime.