Scholars@Duke publication: On nearest-neighbor Gaussian process models for massive spatial data.

On nearest-neighbor Gaussian process models for massive spatial data.

Publication , Journal Article

Datta, A; Banerjee, S; Finley, AO; Gelfand, AE

Published in: Wiley interdisciplinary reviews. Computational statistics

September 2016

Gaussian Process (GP) models provide a very flexible nonparametric approach to modeling location-and-time indexed datasets. However, the storage and computational requirements for GP models are infeasible for large spatial datasets. Nearest Neighbor Gaussian Processes (Datta A, Banerjee S, Finley AO, Gelfand AE. Hierarchical nearest-neighbor gaussian process models for large geostatistical datasets. J Am Stat Assoc 2016., JASA) provide a scalable alternative by using local information from few nearest neighbors. Scalability is achieved by using the neighbor sets in a conditional specification of the model. We show how this is equivalent to sparse modeling of Cholesky factors of large covariance matrices. We also discuss a general approach to construct scalable Gaussian Processes using sparse local kriging. We present a multivariate data analysis which demonstrates how the nearest neighbor approach yields inference indistinguishable from the full rank GP despite being several times faster. Finally, we also propose a variant of the NNGP model for automating the selection of the neighbor set size.

Duke Scholars

Author Alan E. Gelfand Statistical Science

Published In

Wiley interdisciplinary reviews. Computational statistics

DOI

10.1002/wics.1383

EISSN

1939-0068

ISSN

1939-5108

Publication Date

September 2016

Volume

Issue

Start / End Page

162 / 171

Related Subject Headings

4905 Statistics
4605 Data management and data science
0802 Computation Theory and Mathematics
0104 Statistics
0102 Applied Mathematics

Citation

APA

Chicago

ICMJE

MLA

NLM

Datta, A., Banerjee, S., Finley, A. O., & Gelfand, A. E. (2016). On nearest-neighbor Gaussian process models for massive spatial data. Wiley Interdisciplinary Reviews. Computational Statistics, 8(5), 162–171. https://doi.org/10.1002/wics.1383

Datta, Abhirup, Sudipto Banerjee, Andrew O. Finley, and Alan E. Gelfand. “On nearest-neighbor Gaussian process models for massive spatial data.” Wiley Interdisciplinary Reviews. Computational Statistics 8, no. 5 (September 2016): 162–71. https://doi.org/10.1002/wics.1383.

Datta A, Banerjee S, Finley AO, Gelfand AE. On nearest-neighbor Gaussian process models for massive spatial data. Wiley interdisciplinary reviews Computational statistics. 2016 Sep;8(5):162–71.

Datta, Abhirup, et al. “On nearest-neighbor Gaussian process models for massive spatial data.” Wiley Interdisciplinary Reviews. Computational Statistics, vol. 8, no. 5, Sept. 2016, pp. 162–71. Epmc, doi:10.1002/wics.1383.

Datta A, Banerjee S, Finley AO, Gelfand AE. On nearest-neighbor Gaussian process models for massive spatial data. Wiley interdisciplinary reviews Computational statistics. 2016 Sep;8(5):162–171.

Published In

Wiley interdisciplinary reviews. Computational statistics

DOI

10.1002/wics.1383

EISSN

1939-0068

ISSN

1939-5108

Publication Date

September 2016

Volume

Issue

Start / End Page

162 / 171

Related Subject Headings

4905 Statistics
4605 Data management and data science
0802 Computation Theory and Mathematics
0104 Statistics
0102 Applied Mathematics