Skip to main content

Neural Stein Critics with Staged L2-Regularization

Publication ,  Journal Article
Repasky, M; Cheng, X; Xie, Y
Published in: IEEE Transactions on Information Theory
November 1, 2023

Learning to differentiate model distributions from observed data is a fundamental problem in statistics and machine learning, and high-dimensional data remains a challenging setting for such problems. Metrics that quantify the disparity in probability distributions, such as the Stein discrepancy, play an important role in high-dimensional statistical testing. In this paper, we investigate the role of L2 regularization in training a neural network Stein critic so as to distinguish between data sampled from an unknown probability distribution and a nominal model distribution. Making a connection to the Neural Tangent Kernel (NTK) theory, we develop a novel staging procedure for the weight of regularization over training time, which leverages the advantages of highly-regularized training at early times. Theoretically, we prove the approximation of the training dynamic by the kernel optimization, namely the 'lazy training', when the L2 regularization weight is large, and training on n samples converge at a rate of O(n-1/2) up to a log factor. The result guarantees learning the optimal critic assuming sufficient alignment with the leading eigen-modes of the zero-time NTK. The benefit of the staged L2 regularization is demonstrated on simulated high dimensional data and an application to evaluating generative models of image data.

Duke Scholars

Published In

IEEE Transactions on Information Theory

DOI

EISSN

1557-9654

ISSN

0018-9448

Publication Date

November 1, 2023

Volume

69

Issue

11

Start / End Page

7246 / 7275

Related Subject Headings

  • Networking & Telecommunications
  • 4613 Theory of computation
  • 4006 Communications engineering
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Repasky, M., Cheng, X., & Xie, Y. (2023). Neural Stein Critics with Staged L2-Regularization. IEEE Transactions on Information Theory, 69(11), 7246–7275. https://doi.org/10.1109/TIT.2023.3299258
Repasky, M., X. Cheng, and Y. Xie. “Neural Stein Critics with Staged L2-Regularization.” IEEE Transactions on Information Theory 69, no. 11 (November 1, 2023): 7246–75. https://doi.org/10.1109/TIT.2023.3299258.
Repasky M, Cheng X, Xie Y. Neural Stein Critics with Staged L2-Regularization. IEEE Transactions on Information Theory. 2023 Nov 1;69(11):7246–75.
Repasky, M., et al. “Neural Stein Critics with Staged L2-Regularization.” IEEE Transactions on Information Theory, vol. 69, no. 11, Nov. 2023, pp. 7246–75. Scopus, doi:10.1109/TIT.2023.3299258.
Repasky M, Cheng X, Xie Y. Neural Stein Critics with Staged L2-Regularization. IEEE Transactions on Information Theory. 2023 Nov 1;69(11):7246–7275.

Published In

IEEE Transactions on Information Theory

DOI

EISSN

1557-9654

ISSN

0018-9448

Publication Date

November 1, 2023

Volume

69

Issue

11

Start / End Page

7246 / 7275

Related Subject Headings

  • Networking & Telecommunications
  • 4613 Theory of computation
  • 4006 Communications engineering
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing