Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays.

Journal Article

Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs.

Full Text

Duke Authors

Cited Authors

  • Dellinger, AE; Saw, S-M; Goh, LK; Seielstad, M; Young, TL; Li, Y-J

Published Date

  • May 2010

Published In

Volume / Issue

  • 38 / 9

Start / End Page

  • e105 -

PubMed ID

  • 20142258

Electronic International Standard Serial Number (EISSN)

  • 1362-4962

Digital Object Identifier (DOI)

  • 10.1093/nar/gkq040

Language

  • eng

Conference Location

  • England