Scholars@Duke publication: New techniques for DNA sequence classification.

New techniques for DNA sequence classification.

Publication , Journal Article

Wang, JT; Rozen, S; Shapiro, BA; Shasha, D; Wang, Z; Yin, M

Published in: J Comput Biol

1999

DNA sequence classification is the activity of determining whether or not an unlabeled sequence S belongs to an existing class C. This paper proposes two new techniques for DNA sequence classification. The first technique works by comparing the unlabeled sequence S with a group of active motifs discovered from the elements of C and by distinction with elements outside of C. The second technique generates and matches gapped fingerprints of S with elements of C. Experimental results obtained by running these algorithms on long and well conserved Alu sequences demonstrate the good performance of the presented methods compared with FASTA. When applied to less conserved and relatively short functional sites such as splice-junctions, a variation of the second technique combining fingerprinting with consensus sequence analysis gives better results than the current classifiers employing text compression and machine learning algorithms.

Duke Scholars

Author Steven George Rozen Biostatistics & Bioinformatics, Division of Integrative Geno ...

Published In

J Comput Biol

DOI

10.1089/cmb.1999.6.209

ISSN

1066-5277

Publication Date

1999

Volume

Issue

Start / End Page

209 / 218

Location

United States

Related Subject Headings

Software
Sequence Analysis, DNA
Regulatory Sequences, Nucleic Acid
RNA Splicing
Molecular Weight
False Negative Reactions
DNA Fingerprinting
DNA
Conserved Sequence
Consensus Sequence

Citation

APA

Chicago

ICMJE

MLA

NLM

Wang, J. T., Rozen, S., Shapiro, B. A., Shasha, D., Wang, Z., & Yin, M. (1999). New techniques for DNA sequence classification. J Comput Biol, 6(2), 209–218. https://doi.org/10.1089/cmb.1999.6.209

Wang, J. T., S. Rozen, B. A. Shapiro, D. Shasha, Z. Wang, and M. Yin. “New techniques for DNA sequence classification.” J Comput Biol 6, no. 2 (1999): 209–18. https://doi.org/10.1089/cmb.1999.6.209.

Wang JT, Rozen S, Shapiro BA, Shasha D, Wang Z, Yin M. New techniques for DNA sequence classification. J Comput Biol. 1999;6(2):209–18.

Wang, J. T., et al. “New techniques for DNA sequence classification.” J Comput Biol, vol. 6, no. 2, 1999, pp. 209–18. Pubmed, doi:10.1089/cmb.1999.6.209.

Wang JT, Rozen S, Shapiro BA, Shasha D, Wang Z, Yin M. New techniques for DNA sequence classification. J Comput Biol. 1999;6(2):209–218.

Published In

J Comput Biol

DOI

10.1089/cmb.1999.6.209

ISSN

1066-5277

Publication Date

1999

Volume

Issue

Start / End Page

209 / 218

Location

United States

Related Subject Headings

Software
Sequence Analysis, DNA
Regulatory Sequences, Nucleic Acid
RNA Splicing
Molecular Weight
False Negative Reactions
DNA Fingerprinting
DNA
Conserved Sequence
Consensus Sequence