Skip to main content
Journal cover image

On nearest-neighbor Gaussian process models for massive spatial data.

Publication ,  Journal Article
Datta, A; Banerjee, S; Finley, AO; Gelfand, AE
Published in: Wiley interdisciplinary reviews. Computational statistics
September 2016

Gaussian Process (GP) models provide a very flexible nonparametric approach to modeling location-and-time indexed datasets. However, the storage and computational requirements for GP models are infeasible for large spatial datasets. Nearest Neighbor Gaussian Processes (Datta A, Banerjee S, Finley AO, Gelfand AE. Hierarchical nearest-neighbor gaussian process models for large geostatistical datasets. J Am Stat Assoc 2016., JASA) provide a scalable alternative by using local information from few nearest neighbors. Scalability is achieved by using the neighbor sets in a conditional specification of the model. We show how this is equivalent to sparse modeling of Cholesky factors of large covariance matrices. We also discuss a general approach to construct scalable Gaussian Processes using sparse local kriging. We present a multivariate data analysis which demonstrates how the nearest neighbor approach yields inference indistinguishable from the full rank GP despite being several times faster. Finally, we also propose a variant of the NNGP model for automating the selection of the neighbor set size.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Wiley interdisciplinary reviews. Computational statistics

DOI

EISSN

1939-0068

ISSN

1939-5108

Publication Date

September 2016

Volume

8

Issue

5

Start / End Page

162 / 171

Related Subject Headings

  • 4905 Statistics
  • 4605 Data management and data science
  • 0802 Computation Theory and Mathematics
  • 0104 Statistics
  • 0102 Applied Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Datta, A., Banerjee, S., Finley, A. O., & Gelfand, A. E. (2016). On nearest-neighbor Gaussian process models for massive spatial data. Wiley Interdisciplinary Reviews. Computational Statistics, 8(5), 162–171. https://doi.org/10.1002/wics.1383
Datta, Abhirup, Sudipto Banerjee, Andrew O. Finley, and Alan E. Gelfand. “On nearest-neighbor Gaussian process models for massive spatial data.Wiley Interdisciplinary Reviews. Computational Statistics 8, no. 5 (September 2016): 162–71. https://doi.org/10.1002/wics.1383.
Datta A, Banerjee S, Finley AO, Gelfand AE. On nearest-neighbor Gaussian process models for massive spatial data. Wiley interdisciplinary reviews Computational statistics. 2016 Sep;8(5):162–71.
Datta, Abhirup, et al. “On nearest-neighbor Gaussian process models for massive spatial data.Wiley Interdisciplinary Reviews. Computational Statistics, vol. 8, no. 5, Sept. 2016, pp. 162–71. Epmc, doi:10.1002/wics.1383.
Datta A, Banerjee S, Finley AO, Gelfand AE. On nearest-neighbor Gaussian process models for massive spatial data. Wiley interdisciplinary reviews Computational statistics. 2016 Sep;8(5):162–171.
Journal cover image

Published In

Wiley interdisciplinary reviews. Computational statistics

DOI

EISSN

1939-0068

ISSN

1939-5108

Publication Date

September 2016

Volume

8

Issue

5

Start / End Page

162 / 171

Related Subject Headings

  • 4905 Statistics
  • 4605 Data management and data science
  • 0802 Computation Theory and Mathematics
  • 0104 Statistics
  • 0102 Applied Mathematics