A phylogenetic transform enhances analysis of compositional microbiota data.
Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities.
Duke Scholars
Altmetric Attention Stats
Dimensions Citation Stats
Published In
DOI
EISSN
Publication Date
Volume
Location
Related Subject Headings
- Microbiota
- Humans
- Computational Biology
- Biostatistics
- 42 Health sciences
- 32 Biomedical and clinical sciences
- 31 Biological sciences
- 0601 Biochemistry and Cell Biology
Citation
Published In
DOI
EISSN
Publication Date
Volume
Location
Related Subject Headings
- Microbiota
- Humans
- Computational Biology
- Biostatistics
- 42 Health sciences
- 32 Biomedical and clinical sciences
- 31 Biological sciences
- 0601 Biochemistry and Cell Biology