Modeling disease incidence data with spatial and spatio temporal dirichlet process mixtures.

Journal Article (Journal Article)

Disease incidence or mortality data are typically available as rates or counts for specified regions, collected over time. We propose Bayesian nonparametric spatial modeling approaches to analyze such data. We develop a hierarchical specification using spatial random effects modeled with a Dirichlet process prior. The Dirichlet process is centered around a multivariate normal distribution. This latter distribution arises from a log-Gaussian process model that provides a latent incidence rate surface, followed by block averaging to the areal units determined by the regions in the study. With regard to the resulting posterior predictive inference, the modeling approach is shown to be equivalent to an approach based on block averaging of a spatial Dirichlet process to obtain a prior probability model for the finite dimensional distribution of the spatial random effects. We introduce a dynamic formulation for the spatial random effects to extend the model to spatio-temporal settings. Posterior inference is implemented through Gibbs sampling. We illustrate the methodology with simulated data as well as with a data set on lung cancer incidences for all 88 counties in the state of Ohio over an observation period of 21 years.

Full Text

Duke Authors

Cited Authors

  • Kottas, A; Duan, JA; Gelfand, AE

Published Date

  • February 2008

Published In

Volume / Issue

  • 50 / 1

Start / End Page

  • 29 - 42

PubMed ID

  • 17926327

Electronic International Standard Serial Number (EISSN)

  • 1521-4036

International Standard Serial Number (ISSN)

  • 0323-3847

Digital Object Identifier (DOI)

  • 10.1002/bimj.200610375


  • eng