Skip to main content

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets

Publication ,  Journal Article
Datta, A; Banerjee, S; Finley, AO; Gelfand, AE
Published in: Journal of the American Statistical Association
April 2, 2016

Spatial process models for analyzing geostatistical data entail computations that become prohibitive as the number of spatial locations become large. This article develops a class of highly scalable nearest-neighbor Gaussian process (NNGP) models to provide fully model-based inference for large geostatistical datasets. We establish that the NNGP is a well-defined spatial process providing legitimate finite-dimensional Gaussian densities with sparse precision matrices. We embed the NNGP as a sparsity-inducing prior within a rich hierarchical modeling framework and outline how computationally efficient Markov chain Monte Carlo (MCMC) algorithms can be executed without storing or decomposing large matrices. The floating point operations (flops) per iteration of this algorithm is linear in the number of spatial locations, thereby rendering substantial scalability. We illustrate the computational and inferential benefits of the NNGP over competing methods using simulation studies and also analyze forest biomass from a massive U.S. Forest Inventory dataset at a scale that precludes alternative dimension-reducing methods. Supplementary materials for this article are available online.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Journal of the American Statistical Association

DOI

EISSN

1537-274X

ISSN

0162-1459

Publication Date

April 2, 2016

Volume

111

Issue

514

Start / End Page

800 / 812

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Datta, A., Banerjee, S., Finley, A. O., & Gelfand, A. E. (2016). Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets. Journal of the American Statistical Association, 111(514), 800–812. https://doi.org/10.1080/01621459.2015.1044091
Datta, A., S. Banerjee, A. O. Finley, and A. E. Gelfand. “Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets.” Journal of the American Statistical Association 111, no. 514 (April 2, 2016): 800–812. https://doi.org/10.1080/01621459.2015.1044091.
Datta A, Banerjee S, Finley AO, Gelfand AE. Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets. Journal of the American Statistical Association. 2016 Apr 2;111(514):800–12.
Datta, A., et al. “Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets.” Journal of the American Statistical Association, vol. 111, no. 514, Apr. 2016, pp. 800–12. Scopus, doi:10.1080/01621459.2015.1044091.
Datta A, Banerjee S, Finley AO, Gelfand AE. Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets. Journal of the American Statistical Association. 2016 Apr 2;111(514):800–812.

Published In

Journal of the American Statistical Association

DOI

EISSN

1537-274X

ISSN

0162-1459

Publication Date

April 2, 2016

Volume

111

Issue

514

Start / End Page

800 / 812

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics