Skip to main content
Journal cover image

Improving the performance of predictive process modeling for large datasets.

Publication ,  Journal Article
Finley, AO; Sang, H; Banerjee, S; Gelfand, AE
Published in: Computational statistics & data analysis
June 2009

Advances in Geographical Information Systems (GIS) and Global Positioning Systems (GPS) enable accurate geocoding of locations where scientific data are collected. This has encouraged collection of large spatial datasets in many fields and has generated considerable interest in statistical modeling for location-referenced spatial data. The setting where the number of locations yielding observations is too large to fit the desired hierarchical spatial random effects models using Markov chain Monte Carlo methods is considered. This problem is exacerbated in spatial-temporal and multivariate settings where many observations occur at each location. The recently proposed predictive process, motivated by kriging ideas, aims to maintain the richness of desired hierarchical spatial modeling specifications in the presence of large datasets. A shortcoming of the original formulation of the predictive process is that it induces a positive bias in the non-spatial error term of the models. A modified predictive process is proposed to address this problem. The predictive process approach is knot-based leading to questions regarding knot design. An algorithm is designed to achieve approximately optimal spatial placement of knots. Detailed illustrations of the modified predictive process using multivariate spatial regression with both a simulated and a real dataset are offered.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Computational statistics & data analysis

DOI

EISSN

1872-7352

ISSN

0167-9473

Publication Date

June 2009

Volume

53

Issue

8

Start / End Page

2873 / 2884

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1403 Econometrics
  • 0802 Computation Theory and Mathematics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Finley, A. O., Sang, H., Banerjee, S., & Gelfand, A. E. (2009). Improving the performance of predictive process modeling for large datasets. Computational Statistics & Data Analysis, 53(8), 2873–2884. https://doi.org/10.1016/j.csda.2008.09.008
Finley, Andrew O., Huiyan Sang, Sudipto Banerjee, and Alan E. Gelfand. “Improving the performance of predictive process modeling for large datasets.Computational Statistics & Data Analysis 53, no. 8 (June 2009): 2873–84. https://doi.org/10.1016/j.csda.2008.09.008.
Finley AO, Sang H, Banerjee S, Gelfand AE. Improving the performance of predictive process modeling for large datasets. Computational statistics & data analysis. 2009 Jun;53(8):2873–84.
Finley, Andrew O., et al. “Improving the performance of predictive process modeling for large datasets.Computational Statistics & Data Analysis, vol. 53, no. 8, June 2009, pp. 2873–84. Epmc, doi:10.1016/j.csda.2008.09.008.
Finley AO, Sang H, Banerjee S, Gelfand AE. Improving the performance of predictive process modeling for large datasets. Computational statistics & data analysis. 2009 Jun;53(8):2873–2884.
Journal cover image

Published In

Computational statistics & data analysis

DOI

EISSN

1872-7352

ISSN

0167-9473

Publication Date

June 2009

Volume

53

Issue

8

Start / End Page

2873 / 2884

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1403 Econometrics
  • 0802 Computation Theory and Mathematics
  • 0104 Statistics