Skip to main content
Journal cover image

Differential expression analysis for RNAseq using Poisson mixed models.

Publication ,  Journal Article
Sun, S; Hood, M; Scott, L; Peng, Q; Mukherjee, S; Tung, J; Zhou, X
Published in: Nucleic acids research
June 2017

Identifying differentially expressed (DE) genes from RNA sequencing (RNAseq) studies is among the most common analyses in genomics. However, RNAseq DE analysis presents several statistical and computational challenges, including over-dispersed read counts and, in some settings, sample non-independence. Previous count-based methods rely on simple hierarchical Poisson models (e.g. negative binomial) to model independent over-dispersion, but do not account for sample non-independence due to relatedness, population structure and/or hidden confounders. Here, we present a Poisson mixed model with two random effects terms that account for both independent over-dispersion and sample non-independence. We also develop a scalable sampling-based inference algorithm using a latent variable representation of the Poisson distribution. With simulations, we show that our method properly controls for type I error and is generally more powerful than other widely used approaches, except in small samples (n <15) with other unfavorable properties (e.g. small effect sizes). We also apply our method to three real datasets that contain related individuals, population stratification or hidden confounders. Our results show that our method increases power in all three data compared to other approaches, though the power gain is smallest in the smallest sample (n = 6). Our method is implemented in MACAU, freely available at www.xzlab.org/software.html.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Nucleic acids research

DOI

EISSN

1362-4962

ISSN

0305-1048

Publication Date

June 2017

Volume

45

Issue

11

Start / End Page

e106

Related Subject Headings

  • Software
  • Sequence Analysis, RNA
  • Poisson Distribution
  • Monte Carlo Method
  • Models, Genetic
  • Markov Chains
  • Linear Models
  • Humans
  • Gene Expression Profiling
  • Developmental Biology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Sun, S., Hood, M., Scott, L., Peng, Q., Mukherjee, S., Tung, J., & Zhou, X. (2017). Differential expression analysis for RNAseq using Poisson mixed models. Nucleic Acids Research, 45(11), e106. https://doi.org/10.1093/nar/gkx204
Sun, Shiquan, Michelle Hood, Laura Scott, Qinke Peng, Sayan Mukherjee, Jenny Tung, and Xiang Zhou. “Differential expression analysis for RNAseq using Poisson mixed models.Nucleic Acids Research 45, no. 11 (June 2017): e106. https://doi.org/10.1093/nar/gkx204.
Sun S, Hood M, Scott L, Peng Q, Mukherjee S, Tung J, et al. Differential expression analysis for RNAseq using Poisson mixed models. Nucleic acids research. 2017 Jun;45(11):e106.
Sun, Shiquan, et al. “Differential expression analysis for RNAseq using Poisson mixed models.Nucleic Acids Research, vol. 45, no. 11, June 2017, p. e106. Epmc, doi:10.1093/nar/gkx204.
Sun S, Hood M, Scott L, Peng Q, Mukherjee S, Tung J, Zhou X. Differential expression analysis for RNAseq using Poisson mixed models. Nucleic acids research. 2017 Jun;45(11):e106.
Journal cover image

Published In

Nucleic acids research

DOI

EISSN

1362-4962

ISSN

0305-1048

Publication Date

June 2017

Volume

45

Issue

11

Start / End Page

e106

Related Subject Headings

  • Software
  • Sequence Analysis, RNA
  • Poisson Distribution
  • Monte Carlo Method
  • Models, Genetic
  • Markov Chains
  • Linear Models
  • Humans
  • Gene Expression Profiling
  • Developmental Biology