Skip to main content

Bayesian analysis of dynamic linear topic models

Publication ,  Journal Article
Glynn, C; Tokdar, ST; Howard, B; Banks, DL
Published in: Bayesian Analysis
January 1, 2019

Discovering temporal evolution of themes from a time-stamped collection of text poses a challenging statistical learning problem. Dynamic topic models offer a probabilistic modeling framework to decompose a corpus of text documents into "topics", i.e., probability distributions over vocabulary terms, while simultaneously learning the temporal dynamics of the relative prevalence of these topics. We extend the dynamic topic model of Blei and Lafferty (2006) by fusing its multinomial factor model on topics with dynamic linear models that account for time trends and seasonality in topic prevalence. A Markov chain Monte Carlo (MCMC) algorithm that utilizes Pólya-Gamma data augmentation is developed for posterior sampling. Conditional independencies in the model and sampling are made explicit, and our MCMC algorithm is parallelized where possible to allow for inference in large corpora. Our model and inference algorithm are validated with multiple synthetic examples, and we consider the applied problem of modeling trends in real estate listings from the housing website Zillow. We demonstrate in synthetic examples that sharing information across documents is critical for accurately estimating document-specific topic proportions. Analysis of the Zillow corpus demonstrates that the method is able to learn seasonal patterns and locally linear trends in topic prevalence.

Duke Scholars

Published In

Bayesian Analysis

DOI

EISSN

1931-6690

ISSN

1936-0975

Publication Date

January 1, 2019

Volume

14

Issue

1

Start / End Page

53 / 80

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Glynn, C., Tokdar, S. T., Howard, B., & Banks, D. L. (2019). Bayesian analysis of dynamic linear topic models. Bayesian Analysis, 14(1), 53–80. https://doi.org/10.1214/18-BA1100
Glynn, C., S. T. Tokdar, B. Howard, and D. L. Banks. “Bayesian analysis of dynamic linear topic models.” Bayesian Analysis 14, no. 1 (January 1, 2019): 53–80. https://doi.org/10.1214/18-BA1100.
Glynn C, Tokdar ST, Howard B, Banks DL. Bayesian analysis of dynamic linear topic models. Bayesian Analysis. 2019 Jan 1;14(1):53–80.
Glynn, C., et al. “Bayesian analysis of dynamic linear topic models.” Bayesian Analysis, vol. 14, no. 1, Jan. 2019, pp. 53–80. Scopus, doi:10.1214/18-BA1100.
Glynn C, Tokdar ST, Howard B, Banks DL. Bayesian analysis of dynamic linear topic models. Bayesian Analysis. 2019 Jan 1;14(1):53–80.

Published In

Bayesian Analysis

DOI

EISSN

1931-6690

ISSN

1936-0975

Publication Date

January 1, 2019

Volume

14

Issue

1

Start / End Page

53 / 80

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 0104 Statistics