Skip to main content

Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input

Publication ,  Conference
Chen, Z; Ge, R
Published in: Advances in Neural Information Processing Systems
January 1, 2024

In this work, we study the mean-field flow for learning subspace-sparse polynomials using stochastic gradient descent and two-layer neural networks, where the input distribution is standard Gaussian and the output only depends on the projection of the input onto a low-dimensional subspace. We establish a necessary condition for SGD-learnability, involving both the characteristics of the target function and the expressiveness of the activation function. In addition, we prove that the condition is almost sufficient, in the sense that a condition slightly stronger than the necessary condition can guarantee the exponential decay of the loss functional to zero.

Duke Scholars

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Chen, Z., & Ge, R. (2024). Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input. In Advances in Neural Information Processing Systems (Vol. 37).
Chen, Z., and R. Ge. “Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input.” In Advances in Neural Information Processing Systems, Vol. 37, 2024.
Chen Z, Ge R. Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input. In: Advances in Neural Information Processing Systems. 2024.
Chen, Z., and R. Ge. “Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input.” Advances in Neural Information Processing Systems, vol. 37, 2024.
Chen Z, Ge R. Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input. Advances in Neural Information Processing Systems. 2024.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology