Sequential modeling for identifying CpG island locations in human genome


Journal Article (Letter)

We consider several sequential processing algorithms for identifying genes in human DNA, based on detecting CpG ("C proceeds G") islands. The algorithms are designed to capture the underlying statistical structure in a DNA sequence. Sequential processing using a Markov model and a hidden Markov model are shown to identify most CpG islands in annotated (marked) DNA subsequences available from publicly available DNA datasets. We also consider a wavelet-based hidden Markov tree (HMT). In the context of the HMT, we address design of adaptive wavelets matched to CpG islands, this accomplished via lifting and genetic-algorithm optimization.

Full Text

Duke Authors

Cited Authors

  • Dasgupta, N; Lin, S; Carin, L

Published Date

  • December 1, 2002

Published In

Volume / Issue

  • 9 / 12

Start / End Page

  • 407 - 409

International Standard Serial Number (ISSN)

  • 1070-9908

Digital Object Identifier (DOI)

  • 10.1109/LSP.2002.806062

Citation Source

  • Scopus