An algorithm for finding substantially broken repeated sequences in newly sequenced genomes

Conference Paper

Interspersed repeats occupy a significant fraction of many eukaryotic genomes. They result from the activity and accumulation of transposable elements, sequences which are able to replicate in virtually all organisms and which have been successfully maintained through the evolution. With the increasing availability of higher eukaryotic genomes, the identification and annotation of repeats has become an important task in genome biology and it has provoked a shift from the study of individual elements to their genome-wide distributions. In this paper we present a new method for de novo identification of repetitive segments in a genome, particularly suitable to identify these present in large copy numbers but which have diverged so much that they cannot be recognized by existing techniques, generally relying on relatively high sequence similarity between the copies. © 2008 American Institute of Physics.

Full Text

Duke Authors

Cited Authors

  • Singh, A; Stojanovic, N

Published Date

  • December 1, 2007

Published In

Volume / Issue

  • 971 /

Start / End Page

  • 79 - 88

Electronic International Standard Serial Number (EISSN)

  • 1551-7616

International Standard Serial Number (ISSN)

  • 0094-243X

International Standard Book Number 13 (ISBN-13)

  • 9780735404892

Digital Object Identifier (DOI)

  • 10.1063/1.2883871

Citation Source

  • Scopus