Skip to main content

Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation.

Publication ,  Journal Article
Hafez, D; Ni, T; Mukherjee, S; Zhu, J; Ohler, U
Published in: Bioinformatics
July 1, 2013

MOTIVATION: Pre-mRNA cleavage and polyadenylation are essential steps for 3'-end maturation and subsequent stability and degradation of mRNAs. This process is highly controlled by cis-regulatory elements surrounding the cleavage/polyadenylation sites (polyA sites), which are frequently constrained by sequence content and position. More than 50% of human transcripts have multiple functional polyA sites, and the specific use of alternative polyA sites (APA) results in isoforms with variable 3'-untranslated regions, thus potentially affecting gene regulation. Elucidating the regulatory mechanisms underlying differential polyA preferences in multiple cell types has been hindered both by the lack of suitable data on the precise location of cleavage sites, as well as of appropriate tests for determining APAs with significant differences across multiple libraries. RESULTS: We applied a tailored paired-end RNA-seq protocol to specifically probe the position of polyA sites in three human adult tissue types. We specified a linear-effects regression model to identify tissue-specific biases indicating regulated APA; the significance of differences between tissue types was assessed by an appropriately designed permutation test. This combination allowed to identify highly specific subsets of APA events in the individual tissue types. Predictive models successfully classified constitutive polyA sites from a biologically relevant background (auROC = 99.6%), as well as tissue-specific regulated sets from each other. We found that the main cis-regulatory elements described for polyadenylation are a strong, and highly informative, hallmark for constitutive sites only. Tissue-specific regulated sites were found to contain other regulatory motifs, with the canonical polyadenylation signal being nearly absent at brain-specific polyA sites. Together, our results contribute to the understanding of the diversity of post-transcriptional gene regulation. AVAILABILITY: Raw data are deposited on SRA, accession numbers: brain SRX208132, kidney SRX208087 and liver SRX208134. Processed datasets as well as model code are published on our website: http://www.genome.duke.edu/labs/ohler/research/UTR/. CONTACT: uwe.ohler@duke.edu.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

July 1, 2013

Volume

29

Issue

13

Start / End Page

i108 / i116

Location

England

Related Subject Headings

  • Sequence Analysis, RNA
  • Regulatory Sequences, Ribonucleic Acid
  • Polyadenylation
  • Poly A
  • Organ Specificity
  • Linear Models
  • Humans
  • Genomics
  • Bioinformatics
  • Adult
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Hafez, D., Ni, T., Mukherjee, S., Zhu, J., & Ohler, U. (2013). Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation. Bioinformatics, 29(13), i108–i116. https://doi.org/10.1093/bioinformatics/btt233
Hafez, Dina, Ting Ni, Sayan Mukherjee, Jun Zhu, and Uwe Ohler. “Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation.Bioinformatics 29, no. 13 (July 1, 2013): i108–16. https://doi.org/10.1093/bioinformatics/btt233.
Hafez D, Ni T, Mukherjee S, Zhu J, Ohler U. Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation. Bioinformatics. 2013 Jul 1;29(13):i108–16.
Hafez, Dina, et al. “Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation.Bioinformatics, vol. 29, no. 13, July 2013, pp. i108–16. Pubmed, doi:10.1093/bioinformatics/btt233.
Hafez D, Ni T, Mukherjee S, Zhu J, Ohler U. Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation. Bioinformatics. 2013 Jul 1;29(13):i108–i116.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

July 1, 2013

Volume

29

Issue

13

Start / End Page

i108 / i116

Location

England

Related Subject Headings

  • Sequence Analysis, RNA
  • Regulatory Sequences, Ribonucleic Acid
  • Polyadenylation
  • Poly A
  • Organ Specificity
  • Linear Models
  • Humans
  • Genomics
  • Bioinformatics
  • Adult