Skip to main content

Classification logit two-sample testing by neural networks for differentiating near manifold densities.

Publication ,  Journal Article
Cheng, X; Cloninger, A
Published in: IEEE transactions on information theory
October 2022

The recent success of generative adversarial networks and variational learning suggests that training a classification network may work well in addressing the classical two-sample problem, which asks to differentiate two densities given finite samples from each one. Network-based methods have the computational advantage that the algorithm scales to large datasets. This paper considers using the classification logit function, which is provided by a trained classification neural network and evaluated on the testing set split of the two datasets, to compute a two-sample statistic. To analyze the approximation and estimation error of the logit function to differentiate near-manifold densities, we introduce a new result of near-manifold integral approximation by neural networks. We then show that the logit function provably differentiates two sub-exponential densities given that the network is sufficiently parametrized, and for on or near manifold densities, the needed network complexity is reduced to only scale with the intrinsic dimensionality. In experiments, the network logit test demonstrates better performance than previous network-based tests using classification accuracy, and also compares favorably to certain kernel maximum mean discrepancy tests on synthetic datasets and hand-written digit datasets.

Duke Scholars

Published In

IEEE transactions on information theory

DOI

ISSN

0018-9448

Publication Date

October 2022

Volume

68

Issue

10

Start / End Page

6631 / 6662

Related Subject Headings

  • Networking & Telecommunications
  • 4613 Theory of computation
  • 4006 Communications engineering
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cheng, X., & Cloninger, A. (2022). Classification logit two-sample testing by neural networks for differentiating near manifold densities. IEEE Transactions on Information Theory, 68(10), 6631–6662. https://doi.org/10.1109/tit.2022.3175691
Cheng, Xiuyuan, and Alexander Cloninger. “Classification logit two-sample testing by neural networks for differentiating near manifold densities.IEEE Transactions on Information Theory 68, no. 10 (October 2022): 6631–62. https://doi.org/10.1109/tit.2022.3175691.
Cheng X, Cloninger A. Classification logit two-sample testing by neural networks for differentiating near manifold densities. IEEE transactions on information theory. 2022 Oct;68(10):6631–62.
Cheng, Xiuyuan, and Alexander Cloninger. “Classification logit two-sample testing by neural networks for differentiating near manifold densities.IEEE Transactions on Information Theory, vol. 68, no. 10, Oct. 2022, pp. 6631–62. Epmc, doi:10.1109/tit.2022.3175691.
Cheng X, Cloninger A. Classification logit two-sample testing by neural networks for differentiating near manifold densities. IEEE transactions on information theory. 2022 Oct;68(10):6631–6662.

Published In

IEEE transactions on information theory

DOI

ISSN

0018-9448

Publication Date

October 2022

Volume

68

Issue

10

Start / End Page

6631 / 6662

Related Subject Headings

  • Networking & Telecommunications
  • 4613 Theory of computation
  • 4006 Communications engineering
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing