Skip to main content

Predicting gene structure changes resulting from genetic variants via exon definition features.

Publication ,  Journal Article
Majoros, WH; Holt, C; Campbell, MS; Ware, D; Yandell, M; Reddy, TE
Published in: Bioinformatics
November 1, 2018

MOTIVATION: Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed and produce functional proteins. RESULTS: We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and non-coding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or non-coding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products and we propose that they may commonly act as cryptic factors in disease. AVAILABILITY AND IMPLEMENTATION: The software is available from geneprediction.org/SGRF. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

November 1, 2018

Volume

34

Issue

21

Start / End Page

3616 / 3623

Location

England

Related Subject Headings

  • Software
  • Sequence Analysis, RNA
  • RNA Splicing
  • Molecular Sequence Annotation
  • Humans
  • Exons
  • Bioinformatics
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Majoros, W. H., Holt, C., Campbell, M. S., Ware, D., Yandell, M., & Reddy, T. E. (2018). Predicting gene structure changes resulting from genetic variants via exon definition features. Bioinformatics, 34(21), 3616–3623. https://doi.org/10.1093/bioinformatics/bty324
Majoros, William H., Carson Holt, Michael S. Campbell, Doreen Ware, Mark Yandell, and Timothy E. Reddy. “Predicting gene structure changes resulting from genetic variants via exon definition features.Bioinformatics 34, no. 21 (November 1, 2018): 3616–23. https://doi.org/10.1093/bioinformatics/bty324.
Majoros WH, Holt C, Campbell MS, Ware D, Yandell M, Reddy TE. Predicting gene structure changes resulting from genetic variants via exon definition features. Bioinformatics. 2018 Nov 1;34(21):3616–23.
Majoros, William H., et al. “Predicting gene structure changes resulting from genetic variants via exon definition features.Bioinformatics, vol. 34, no. 21, Nov. 2018, pp. 3616–23. Pubmed, doi:10.1093/bioinformatics/bty324.
Majoros WH, Holt C, Campbell MS, Ware D, Yandell M, Reddy TE. Predicting gene structure changes resulting from genetic variants via exon definition features. Bioinformatics. 2018 Nov 1;34(21):3616–3623.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

November 1, 2018

Volume

34

Issue

21

Start / End Page

3616 / 3623

Location

England

Related Subject Headings

  • Software
  • Sequence Analysis, RNA
  • RNA Splicing
  • Molecular Sequence Annotation
  • Humans
  • Exons
  • Bioinformatics
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences