Skip to main content
Journal cover image

Comparing methods for multivariate nonparametric regression

Publication ,  Journal Article
Banks, DL; Olszewski, RT; Maxion, RA
Published in: Communications in Statistics Part B: Simulation and Computation
May 1, 2003

The ever-growing number of high-dimensional, superlarge databases requires effective analysis techniques to mine interesting information from the data. Development of new-wave methodologies for high-dimensional nonparametric regression has exploded over the last decade in an effort to meet these analysis demands. This article reports on an extensive simulation experiment that compares the performance of ten different, commonly-used regression techniques: linear regression, stepwise linear regression, additive models (AM), projection pursuit regression (PPR), recursive partitioning regression (RPR), multivariate adaptive regression splines (MARS), alternating conditional expectations (ACE), additivity and variance stabilization (AVAS), locally weighted regression (LOESS), and neural networks. Each regression technique was used to analyze multiple datasets each having a unique embedded structure; the accuracy of each technique was determined by its ability to correctly identify the embedded structure averaged over all the datasets. Datasets used in the experiment were constructed so as to have particular properties which varied across the datasets in order to determine each technique's accuracy within various environments. The dataset properties which were varied include dimension of the data, the true dimension of the embedded structure, the sample size, the amount of noise, and the complexity of the embedded structure. Analyses of the results show that all of these properties affect the accuracy of each regression technique under investigation. A mapping from data characteristics to the most effective regression technique(s) is suggested.

Duke Scholars

Published In

Communications in Statistics Part B: Simulation and Computation

DOI

ISSN

0361-0918

Publication Date

May 1, 2003

Volume

32

Issue

2

Start / End Page

541 / 571

Related Subject Headings

  • Statistics & Probability
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 08 Information and Computing Sciences
  • 01 Mathematical Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Banks, D. L., Olszewski, R. T., & Maxion, R. A. (2003). Comparing methods for multivariate nonparametric regression. Communications in Statistics Part B: Simulation and Computation, 32(2), 541–571. https://doi.org/10.1081/SAC-120017506
Banks, D. L., R. T. Olszewski, and R. A. Maxion. “Comparing methods for multivariate nonparametric regression.” Communications in Statistics Part B: Simulation and Computation 32, no. 2 (May 1, 2003): 541–71. https://doi.org/10.1081/SAC-120017506.
Banks DL, Olszewski RT, Maxion RA. Comparing methods for multivariate nonparametric regression. Communications in Statistics Part B: Simulation and Computation. 2003 May 1;32(2):541–71.
Banks, D. L., et al. “Comparing methods for multivariate nonparametric regression.” Communications in Statistics Part B: Simulation and Computation, vol. 32, no. 2, May 2003, pp. 541–71. Scopus, doi:10.1081/SAC-120017506.
Banks DL, Olszewski RT, Maxion RA. Comparing methods for multivariate nonparametric regression. Communications in Statistics Part B: Simulation and Computation. 2003 May 1;32(2):541–571.
Journal cover image

Published In

Communications in Statistics Part B: Simulation and Computation

DOI

ISSN

0361-0918

Publication Date

May 1, 2003

Volume

32

Issue

2

Start / End Page

541 / 571

Related Subject Headings

  • Statistics & Probability
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 08 Information and Computing Sciences
  • 01 Mathematical Sciences