Scholars@Duke publication: Multiple testing under dependence via graphical models

Multiple testing under dependence via graphical models

Publication , Journal Article

Liu, J; Zhang, C; Page, D

Published in: Annals of Applied Statistics

September 1, 2016

Large-scale multiple testing tasks often exhibit dependence. Leveraging the dependence between individual tests is still one challenging and important problem in statistics. With recent advances in graphical models, it is feasible to use them to capture the dependence among multiple hypotheses. We propose a multiple testing procedure which is based on a Markov-random-field-coupled mixture model. The underlying true states of hypotheses are represented by a latent binary Markov random field, and the observed test statistics appear as the coupled mixture variables. The model can be learned by a novel EM algorithm. The next step is to infer the posterior probability that each hypothesis is null (termed local index of significance), and the false discovery rate can be controlled accordingly. We also provide a semi-parametric variation of the graphical model which is useful in the situation where f1 (the density function of the test statistic under the alternative hypothesis) is heterogeneous among multiple hypotheses. This semiparametric approach exactly generalizes the local FDR procedure [J. Amer. Statist. Assoc. 96 (2001) 1151–1160] and connects with the BH procedure [J. Roy. Statist. Soc. Ser. B 57 (1995) 289–300]. Simulations show that the numerical performance of multiple testing can be improved substantially by using our procedure. We apply the procedure to a real-world genome-wide association study on breast cancer, and we identify several SNPs with strong association evidence.

Duke Scholars

Author David Page Biostatistics & Bioinformatics, Division of Biostatistics

Published In

Annals of Applied Statistics

DOI

10.1214/16-AOAS956

EISSN

1941-7330

ISSN

1932-6157

Publication Date

September 1, 2016

Volume

Issue

Start / End Page

1699 / 1724

Related Subject Headings

Statistics & Probability
4905 Statistics
1403 Econometrics
0104 Statistics

Citation

APA

Chicago

ICMJE

MLA

NLM

Liu, J., Zhang, C., & Page, D. (2016). Multiple testing under dependence via graphical models. Annals of Applied Statistics, 10(3), 1699–1724. https://doi.org/10.1214/16-AOAS956

Liu, J., C. Zhang, and D. Page. “Multiple testing under dependence via graphical models.” Annals of Applied Statistics 10, no. 3 (September 1, 2016): 1699–1724. https://doi.org/10.1214/16-AOAS956.

Liu J, Zhang C, Page D. Multiple testing under dependence via graphical models. Annals of Applied Statistics. 2016 Sep 1;10(3):1699–724.

Liu, J., et al. “Multiple testing under dependence via graphical models.” Annals of Applied Statistics, vol. 10, no. 3, Sept. 2016, pp. 1699–724. Scopus, doi:10.1214/16-AOAS956.

Liu J, Zhang C, Page D. Multiple testing under dependence via graphical models. Annals of Applied Statistics. 2016 Sep 1;10(3):1699–1724.

Published In

Annals of Applied Statistics

DOI

10.1214/16-AOAS956

EISSN

1941-7330

ISSN

1932-6157

Publication Date

September 1, 2016

Volume

Issue

Start / End Page

1699 / 1724

Related Subject Headings

Statistics & Probability
4905 Statistics
1403 Econometrics
0104 Statistics