Skip to main content

Protein prediction for trait mapping in diverse populations.

Publication ,  Journal Article
Schubert, R; Geoffroy, E; Gregga, I; Mulford, AJ; Aguet, F; Ardlie, K; Gerszten, R; Clish, C; Van Den Berg, D; Taylor, KD; Durda, P; Guo, X ...
Published in: PLoS One
2022

Genetically regulated gene expression has helped elucidate the biological mechanisms underlying complex traits. Improved high-throughput technology allows similar interrogation of the genetically regulated proteome for understanding complex trait mechanisms. Here, we used the Trans-omics for Precision Medicine (TOPMed) Multi-omics pilot study, which comprises data from Multi-Ethnic Study of Atherosclerosis (MESA), to optimize genetic predictors of the plasma proteome for genetically regulated proteome-wide association studies (PWAS) in diverse populations. We built predictive models for protein abundances using data collected in TOPMed MESA, for which we have measured 1,305 proteins by a SOMAscan assay. We compared predictive models built via elastic net regression to models integrating posterior inclusion probabilities estimated by fine-mapping SNPs prior to elastic net. In order to investigate the transferability of predictive models across ancestries, we built protein prediction models in all four of the TOPMed MESA populations, African American (n = 183), Chinese (n = 71), European (n = 416), and Hispanic/Latino (n = 301), as well as in all populations combined. As expected, fine-mapping produced more significant protein prediction models, especially in African ancestries populations, potentially increasing opportunity for discovery. When we tested our TOPMed MESA models in the independent European INTERVAL study, fine-mapping improved cross-ancestries prediction for some proteins. Using GWAS summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study, which comprises ∼50,000 Hispanic/Latinos, African Americans, Asians, Native Hawaiians, and Native Americans, we applied S-PrediXcan to perform PWAS for 28 complex traits. The most protein-trait associations were discovered, colocalized, and replicated in large independent GWAS using proteome prediction model training populations with similar ancestries to PAGE. At current training population sample sizes, performance between baseline and fine-mapped protein prediction models in PWAS was similar, highlighting the utility of elastic net. Our predictive models in diverse populations are publicly available for use in proteome mapping methods at https://doi.org/10.5281/zenodo.4837327.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

PLoS One

DOI

EISSN

1932-6203

Publication Date

2022

Volume

17

Issue

2

Start / End Page

e0264341

Location

United States

Related Subject Headings

  • Quantitative Trait Loci
  • Proteome
  • Proteins
  • Polymorphism, Single Nucleotide
  • Pilot Projects
  • Models, Genetic
  • Male
  • Humans
  • Genetic Association Studies
  • General Science & Technology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Schubert, R., Geoffroy, E., Gregga, I., Mulford, A. J., Aguet, F., Ardlie, K., … Wheeler, H. E. (2022). Protein prediction for trait mapping in diverse populations. PLoS One, 17(2), e0264341. https://doi.org/10.1371/journal.pone.0264341
Schubert, Ryan, Elyse Geoffroy, Isabelle Gregga, Ashley J. Mulford, Francois Aguet, Kristin Ardlie, Robert Gerszten, et al. “Protein prediction for trait mapping in diverse populations.PLoS One 17, no. 2 (2022): e0264341. https://doi.org/10.1371/journal.pone.0264341.
Schubert R, Geoffroy E, Gregga I, Mulford AJ, Aguet F, Ardlie K, et al. Protein prediction for trait mapping in diverse populations. PLoS One. 2022;17(2):e0264341.
Schubert, Ryan, et al. “Protein prediction for trait mapping in diverse populations.PLoS One, vol. 17, no. 2, 2022, p. e0264341. Pubmed, doi:10.1371/journal.pone.0264341.
Schubert R, Geoffroy E, Gregga I, Mulford AJ, Aguet F, Ardlie K, Gerszten R, Clish C, Van Den Berg D, Taylor KD, Durda P, Johnson WC, Cornell E, Guo X, Liu Y, Tracy R, Conomos M, Blackwell T, Papanicolaou G, Lappalainen T, Mikhaylova AV, Thornton TA, Cho MH, Gignoux CR, Lange L, Lange E, Rich SS, Rotter JI, NHLBI TOPMed Consortium, Manichaikul A, Im HK, Wheeler HE. Protein prediction for trait mapping in diverse populations. PLoS One. 2022;17(2):e0264341.

Published In

PLoS One

DOI

EISSN

1932-6203

Publication Date

2022

Volume

17

Issue

2

Start / End Page

e0264341

Location

United States

Related Subject Headings

  • Quantitative Trait Loci
  • Proteome
  • Proteins
  • Polymorphism, Single Nucleotide
  • Pilot Projects
  • Models, Genetic
  • Male
  • Humans
  • Genetic Association Studies
  • General Science & Technology