A maximum-likelihood base caller for DNA sequencing.


Journal Article

The procedures used to sequence the human genome involve the electrophoretic separation of mixtures of dioxyribonucleic acid (DNA) fragments tagged with reporting groups, usually fluorescent dyes. Each fluorescent pulse which arrives from an optical detector corresponds to a nucleotide (base) in the DNA sequence, and the subsequent process of base detection is known as base calling. Generating longer and more accurate sequences in the base-calling process will reduce the high cost of DNA sequencing. This paper presents an automated base-calling algorithm, referred to as maximum-likelihood base caller (MLB), which is based on maximum likelihood equalization for digital communication channels. Based on 125 experimental datasets, MLB averaged up to 40% fewer errors than the widely used ABI base caller from the Applied Biosystems Division of PE Corporation. MLB's accuracy rivaled that of another well-known base caller, Phred, surpassing it on datasets with high background noise.

Full Text

Duke Authors

Cited Authors

  • Brady, D; Kocic, M; Miller, AW; Karger, BL

Published Date

  • September 2000

Published In

Volume / Issue

  • 47 / 9

Start / End Page

  • 1271 - 1280

PubMed ID

  • 11008429

Pubmed Central ID

  • 11008429

Electronic International Standard Serial Number (EISSN)

  • 1558-2531

International Standard Serial Number (ISSN)

  • 0018-9294

Digital Object Identifier (DOI)

  • 10.1109/10.867962


  • eng