Controlling taxa abundance improves metatranscriptomics differential analysis.
BACKGROUND: A common task in analyzing metatranscriptomics data is to identify microbial metabolic pathways with differential RNA abundances across multiple sample groups. With information from paired metagenomics data, some differential methods control for either DNA or taxa abundances to address their strong correlation with RNA abundance. However, it remains unknown if both factors need to be controlled for simultaneously. RESULTS: We discovered that when either DNA or taxa abundance is controlled for, RNA abundance still has a strong partial correlation with the other factor. In both simulation studies and a real data analysis, we demonstrated that controlling for both DNA and taxa abundances leads to superior performance compared to only controlling for one factor. CONCLUSIONS: To fully address the confounding effects in analyzing metatranscriptomics data, both DNA and taxa abundances need to be controlled for in the differential analysis.
Duke Scholars
Altmetric Attention Stats
Dimensions Citation Stats
Published In
DOI
EISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- RNA
- Microbiology
- Metagenomics
- Computer Simulation
- 3207 Medical microbiology
- 3107 Microbiology
- 11 Medical and Health Sciences
- 07 Agricultural and Veterinary Sciences
- 06 Biological Sciences
Citation
Published In
DOI
EISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- RNA
- Microbiology
- Metagenomics
- Computer Simulation
- 3207 Medical microbiology
- 3107 Microbiology
- 11 Medical and Health Sciences
- 07 Agricultural and Veterinary Sciences
- 06 Biological Sciences