Complexity reduction in context-dependent DNA substitution models.
MOTIVATION: The modeling of conservation patterns in genomic DNA has become increasingly popular for a number of bioinformatic applications. While several systems developed to date incorporate context-dependence in their substitution models, the impact on computational complexity and generalization ability of the resulting higher order models invites the question of whether simpler approaches to context modeling might permit appreciable reductions in model complexity and computational cost, without sacrificing prediction accuracy. RESULTS: We formulate several alternative methods for context modeling based on windowed Bayesian networks, and compare their effects on both accuracy and computational complexity for the task of discriminating functionally distinct segments in vertebrate DNA. Our results show that substantial reductions in the complexity of both the model and the associated inference algorithm can be achieved without reducing predictive accuracy.
Duke Scholars
Published In
DOI
EISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- Software
- Sequence Analysis, DNA
- Models, Genetic
- Genome
- DNA
- Computer Simulation
- Bioinformatics
- Bayes Theorem
- Algorithms
- 49 Mathematical sciences
Citation
Published In
DOI
EISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- Software
- Sequence Analysis, DNA
- Models, Genetic
- Genome
- DNA
- Computer Simulation
- Bioinformatics
- Bayes Theorem
- Algorithms
- 49 Mathematical sciences