Scholars@Duke publication: Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Publication , Journal Article

Koren, S; Schatz, MC; Walenz, BP; Martin, J; Howard, JT; Ganapathy, G; Wang, Z; Rasko, DA; McCombie, WR; Jarvis, ED; Adam M Phillippy

Published in: Nat Biotechnol

July 1, 2012

Published version (DOI) Open Access Copy (Duke) Link to item

Single-molecule sequencing instruments can generate multikilobase sequences with the potential to greatly improve genome and transcriptome assembly. However, the error rates of single-molecule reads are high, which has limited their use thus far to resequencing bacteria. To address this limitation, we introduce a correction algorithm and assembly strategy that uses short, high-fidelity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on reads generated by a PacBio RS instrument from phage, prokaryotic and eukaryotic whole genomes, including the previously unsequenced genome of the parrot Melopsittacus undulatus, as well as for RNA-Seq reads of the corn (Zea mays) transcriptome. Our long-read correction achieves >99.9% base-call accuracy, leading to substantially better assemblies than current sequencing strategies: in the best example, the median contig size was quintupled relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly.

Duke Scholars

Author Erich David Jarvis Neurobiology

Published In

Nat Biotechnol

DOI

10.1038/nbt.2280

EISSN

1546-1696

Publication Date

July 1, 2012

Volume

Issue

Start / End Page

693 / 700

Location

United States

Related Subject Headings

Zea mays
Transcriptome
Sequence Analysis, RNA
RNA
Computational Biology
Bacteriophages
Bacteria
Algorithms

Citation

APA

Chicago

ICMJE

MLA

NLM

Koren, S., Schatz, M. C., Walenz, B. P., Martin, J., Howard, J. T., Ganapathy, G., … Adam M Phillippy. (2012). Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol, 30(7), 693–700. https://doi.org/10.1038/nbt.2280

Koren, Sergey, Michael C. Schatz, Brian P. Walenz, Jeffrey Martin, Jason T. Howard, Ganeshkumar Ganapathy, Zhong Wang, et al. “Hybrid error correction and de novo assembly of single-molecule sequencing reads.” Nat Biotechnol 30, no. 7 (July 1, 2012): 693–700. https://doi.org/10.1038/nbt.2280.

Koren S, Schatz MC, Walenz BP, Martin J, Howard JT, Ganapathy G, et al. Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol. 2012 Jul 1;30(7):693–700.

Koren, Sergey, et al. “Hybrid error correction and de novo assembly of single-molecule sequencing reads.” Nat Biotechnol, vol. 30, no. 7, July 2012, pp. 693–700. Pubmed, doi:10.1038/nbt.2280.

Koren S, Schatz MC, Walenz BP, Martin J, Howard JT, Ganapathy G, Wang Z, Rasko DA, McCombie WR, Jarvis ED, Adam M Phillippy. Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol. 2012 Jul 1;30(7):693–700.

Published In

Nat Biotechnol

DOI

10.1038/nbt.2280

EISSN

1546-1696

Publication Date

July 1, 2012

Volume

Issue

Start / End Page

693 / 700

Location

United States

Related Subject Headings

Zea mays
Transcriptome
Sequence Analysis, RNA
RNA
Computational Biology
Bacteriophages
Bacteria
Algorithms