Skip to main content
Journal cover image

Experimental analysis of sources of error in evolutionary studies based on Roche/454 pyrosequencing of viral genomes.

Publication ,  Journal Article
Becker, EA; Burns, CM; León, EJ; Rajabojan, S; Friedman, R; Friedrich, TC; O'Connor, SL; Hughes, AL
Published in: Genome Biol Evol
2012

Factors affecting the reliability of Roche/454 pyrosequencing for analyzing sequence polymorphism in within-host viral populations were assessed by two experiments: 1) sequencing four clonal simian immunodeficiency virus (SIV) stocks and 2) sequencing mixtures in different proportions of two SIV strains with known fixed nucleotide differences. Observed nucleotide diversity and frequency of undetermined nucleotides were increased at sites in homopolymer runs of four or more identical nucleotides, particularly at AT sites. However, in the mixed-strain experiments, the effects on estimated nucleotide diversity of such errors were small in comparison to known strain differences. The results suggest that biologically meaningful variants present at a frequency of around 10% and possibly much lower are easily distinguished from artifacts of the sequencing process. Analysis of the clonal stocks revealed numerous rare variants that showed the signature of purifying selection and that elimination of variants at frequencies of less than 1% reduced estimates of nucleotide diversity by about an order of magnitude. Thus, using a 1% frequency cutoff for accepting a variant as real represents a conservative standard, which may be useful in studies that are focused on the discovery of specific mutations (such as those conferring immune escape or drug resistance). On the other hand, if the goal is to estimate nucleotide diversity, an optimal strategy might be to include all observed variants (even those at less than 1% frequency), while masking out homopolymer runs of four or more nucleotides.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Genome Biol Evol

DOI

EISSN

1759-6653

Publication Date

2012

Volume

4

Issue

4

Start / End Page

457 / 465

Location

England

Related Subject Headings

  • Simian immunodeficiency virus
  • Simian Immunodeficiency Virus
  • Simian Acquired Immunodeficiency Syndrome
  • Sequence Analysis, DNA
  • Genome, Viral
  • Genetic Variation
  • Evolution, Molecular
  • Developmental Biology
  • Animals
  • 3105 Genetics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Becker, E. A., Burns, C. M., León, E. J., Rajabojan, S., Friedman, R., Friedrich, T. C., … Hughes, A. L. (2012). Experimental analysis of sources of error in evolutionary studies based on Roche/454 pyrosequencing of viral genomes. Genome Biol Evol, 4(4), 457–465. https://doi.org/10.1093/gbe/evs029
Becker, Ericka A., Charles M. Burns, Enrique J. León, Saravanan Rajabojan, Robert Friedman, Thomas C. Friedrich, Shelby L. O’Connor, and Austin L. Hughes. “Experimental analysis of sources of error in evolutionary studies based on Roche/454 pyrosequencing of viral genomes.Genome Biol Evol 4, no. 4 (2012): 457–65. https://doi.org/10.1093/gbe/evs029.
Becker EA, Burns CM, León EJ, Rajabojan S, Friedman R, Friedrich TC, et al. Experimental analysis of sources of error in evolutionary studies based on Roche/454 pyrosequencing of viral genomes. Genome Biol Evol. 2012;4(4):457–65.
Becker, Ericka A., et al. “Experimental analysis of sources of error in evolutionary studies based on Roche/454 pyrosequencing of viral genomes.Genome Biol Evol, vol. 4, no. 4, 2012, pp. 457–65. Pubmed, doi:10.1093/gbe/evs029.
Becker EA, Burns CM, León EJ, Rajabojan S, Friedman R, Friedrich TC, O’Connor SL, Hughes AL. Experimental analysis of sources of error in evolutionary studies based on Roche/454 pyrosequencing of viral genomes. Genome Biol Evol. 2012;4(4):457–465.
Journal cover image

Published In

Genome Biol Evol

DOI

EISSN

1759-6653

Publication Date

2012

Volume

4

Issue

4

Start / End Page

457 / 465

Location

England

Related Subject Headings

  • Simian immunodeficiency virus
  • Simian Immunodeficiency Virus
  • Simian Acquired Immunodeficiency Syndrome
  • Sequence Analysis, DNA
  • Genome, Viral
  • Genetic Variation
  • Evolution, Molecular
  • Developmental Biology
  • Animals
  • 3105 Genetics