Skip to main content

Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Publication ,  Journal Article
Koren, S; Schatz, MC; Walenz, BP; Martin, J; Howard, JT; Ganapathy, G; Wang, Z; Rasko, DA; McCombie, WR; Jarvis, ED; Adam M Phillippy,
Published in: Nat Biotechnol
July 1, 2012

Single-molecule sequencing instruments can generate multikilobase sequences with the potential to greatly improve genome and transcriptome assembly. However, the error rates of single-molecule reads are high, which has limited their use thus far to resequencing bacteria. To address this limitation, we introduce a correction algorithm and assembly strategy that uses short, high-fidelity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on reads generated by a PacBio RS instrument from phage, prokaryotic and eukaryotic whole genomes, including the previously unsequenced genome of the parrot Melopsittacus undulatus, as well as for RNA-Seq reads of the corn (Zea mays) transcriptome. Our long-read correction achieves >99.9% base-call accuracy, leading to substantially better assemblies than current sequencing strategies: in the best example, the median contig size was quintupled relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Nat Biotechnol

DOI

EISSN

1546-1696

Publication Date

July 1, 2012

Volume

30

Issue

7

Start / End Page

693 / 700

Location

United States

Related Subject Headings

  • Zea mays
  • Transcriptome
  • Sequence Analysis, RNA
  • RNA
  • Computational Biology
  • Bacteriophages
  • Bacteria
  • Algorithms
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Koren, S., Schatz, M. C., Walenz, B. P., Martin, J., Howard, J. T., Ganapathy, G., … Adam M Phillippy, . (2012). Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol, 30(7), 693–700. https://doi.org/10.1038/nbt.2280
Koren, Sergey, Michael C. Schatz, Brian P. Walenz, Jeffrey Martin, Jason T. Howard, Ganeshkumar Ganapathy, Zhong Wang, et al. “Hybrid error correction and de novo assembly of single-molecule sequencing reads.Nat Biotechnol 30, no. 7 (July 1, 2012): 693–700. https://doi.org/10.1038/nbt.2280.
Koren S, Schatz MC, Walenz BP, Martin J, Howard JT, Ganapathy G, et al. Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol. 2012 Jul 1;30(7):693–700.
Koren, Sergey, et al. “Hybrid error correction and de novo assembly of single-molecule sequencing reads.Nat Biotechnol, vol. 30, no. 7, July 2012, pp. 693–700. Pubmed, doi:10.1038/nbt.2280.
Koren S, Schatz MC, Walenz BP, Martin J, Howard JT, Ganapathy G, Wang Z, Rasko DA, McCombie WR, Jarvis ED, Adam M Phillippy. Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol. 2012 Jul 1;30(7):693–700.

Published In

Nat Biotechnol

DOI

EISSN

1546-1696

Publication Date

July 1, 2012

Volume

30

Issue

7

Start / End Page

693 / 700

Location

United States

Related Subject Headings

  • Zea mays
  • Transcriptome
  • Sequence Analysis, RNA
  • RNA
  • Computational Biology
  • Bacteriophages
  • Bacteria
  • Algorithms