Skip to main content

Kernel two-sample tests for manifold data

Publication ,  Journal Article
Cheng, X; Xie, Y
Published in: Bernoulli
November 1, 2024

We present a study of a kernel-based two-sample test statistic related to the Maximum Mean Discrepancy (MMD) in the manifold data setting, assuming that high-dimensional observations are close to a low-dimensional manifold. We characterize the test level and power in relation to the kernel bandwidth, the number of samples, and the intrinsic dimensionality of the manifold. Specifically, when data densities p and q are supported on a d-dimensional sub-manifold M embedded in an m-dimensional space and are Hölder with order β (up to 2) on M, we prove a guarantee of the test power for finite sample size n that exceeds a threshold depending on d, β, and Δ2 the squared L2-divergence between p and q on the manifold, and with a properly chosen kernel bandwidth γ. For small density departures, we show that with large n they can be detected by the kernel test when Δ2 is greater than n−2β/(d+4β) up to a certain constant and γ scales as n−1/(d+4β). The analysis extends to cases where the manifold has a boundary and the data samples contain high-dimensional additive noise. Our results indicate that the kernel two-sample test has no curse-of-dimensionality when the data lie on or near a low-dimensional manifold. We validate our theory and the properties of the kernel test for manifold data through a series of numerical experiments.

Duke Scholars

Published In

Bernoulli

DOI

ISSN

1350-7265

Publication Date

November 1, 2024

Volume

30

Issue

4

Start / End Page

2572 / 2597

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 1403 Econometrics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cheng, X., & Xie, Y. (2024). Kernel two-sample tests for manifold data. Bernoulli, 30(4), 2572–2597. https://doi.org/10.3150/23-BEJ1685
Cheng, X., and Y. Xie. “Kernel two-sample tests for manifold data.” Bernoulli 30, no. 4 (November 1, 2024): 2572–97. https://doi.org/10.3150/23-BEJ1685.
Cheng X, Xie Y. Kernel two-sample tests for manifold data. Bernoulli. 2024 Nov 1;30(4):2572–97.
Cheng, X., and Y. Xie. “Kernel two-sample tests for manifold data.” Bernoulli, vol. 30, no. 4, Nov. 2024, pp. 2572–97. Scopus, doi:10.3150/23-BEJ1685.
Cheng X, Xie Y. Kernel two-sample tests for manifold data. Bernoulli. 2024 Nov 1;30(4):2572–2597.

Published In

Bernoulli

DOI

ISSN

1350-7265

Publication Date

November 1, 2024

Volume

30

Issue

4

Start / End Page

2572 / 2597

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 1403 Econometrics
  • 0104 Statistics