fastMitoCalc: an ultra-fast program to estimate mitochondrial DNA copy number from whole-genome sequences
Publication
, Journal Article
Qian, Y; Butler, TJ; Opsahl-Ong, K; Giroux, NS; Sidore, C; Nagaraja, R; Cucca, F; Ferrucci, L; Abecasis, GR; Schlessinger, D; Ding, J
Published in: Bioinformatics
Mitochondrial DNA (mtDNA) copy number is tightly regulated in tissues, and is both a critical determinant of mitochondrial function and a potential biomarker for disease. We and other groups have shown that the mtDNA copy number per cell can be directly estimated from whole-genome sequencing. The computation is based on the rationale that sequencing coverage should be proportional to the underlying DNA copy number for autosomal and mitochondrial DNA, and most computing time is spent calculating the average autosomal DNA coverage across ∼3 billion bases. That makes analyzing tens of thousands of available samples very slow. Here we present fastMitoCalc, which takes advantage of the indexing of sequencing alignment files and uses a randomly selected small subset (0.1%) of the nuclear genome to estimate autosomal DNA coverage accurately. It is more than 100 times faster than current programs. fastMitoCalc also provides an option to estimate copy number using a single autosomal chromosome, which could also achieve high accuracy but is slower. Using fastMitoCalc, it becomes much more feasible now to conduct analyses on large-scale consortium data to test for association of mtDNA copy number with quantitative traits or nuclear variants.