Skip to main content

High Dimensional Variable Selection with Error Control.

Publication ,  Journal Article
Kim, S; Halabi, S
Published in: Biomed Res Int
2016

Background. The iterative sure independence screening (ISIS) is a popular method in selecting important variables while maintaining most of the informative variables relevant to the outcome in high throughput data. However, it not only is computationally intensive but also may cause high false discovery rate (FDR). We propose to use the FDR as a screening method to reduce the high dimension to a lower dimension as well as controlling the FDR with three popular variable selection methods: LASSO, SCAD, and MCP. Method. The three methods with the proposed screenings were applied to prostate cancer data with presence of metastasis as the outcome. Results. Simulations showed that the three variable selection methods with the proposed screenings controlled the predefined FDR and produced high area under the receiver operating characteristic curve (AUROC) scores. In applying these methods to the prostate cancer example, LASSO and MCP selected 12 and 8 genes and produced AUROC scores of 0.746 and 0.764, respectively. Conclusions. We demonstrated that the variable selection methods with the sequential use of FDR and ISIS not only controlled the predefined FDR in the final models but also had relatively high AUROC scores.

Duke Scholars

Published In

Biomed Res Int

DOI

EISSN

2314-6141

Publication Date

2016

Volume

2016

Start / End Page

8209453

Location

United States

Related Subject Headings

  • Sensitivity and Specificity
  • Research Design
  • Reproducibility of Results
  • Regression Analysis
  • ROC Curve
  • Prostatic Neoplasms
  • Neoplasm Metastasis
  • Male
  • Humans
  • Gene Expression Regulation, Neoplastic
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kim, S., & Halabi, S. (2016). High Dimensional Variable Selection with Error Control. Biomed Res Int, 2016, 8209453. https://doi.org/10.1155/2016/8209453
Kim, Sangjin, and Susan Halabi. “High Dimensional Variable Selection with Error Control.Biomed Res Int 2016 (2016): 8209453. https://doi.org/10.1155/2016/8209453.
Kim S, Halabi S. High Dimensional Variable Selection with Error Control. Biomed Res Int. 2016;2016:8209453.
Kim, Sangjin, and Susan Halabi. “High Dimensional Variable Selection with Error Control.Biomed Res Int, vol. 2016, 2016, p. 8209453. Pubmed, doi:10.1155/2016/8209453.
Kim S, Halabi S. High Dimensional Variable Selection with Error Control. Biomed Res Int. 2016;2016:8209453.

Published In

Biomed Res Int

DOI

EISSN

2314-6141

Publication Date

2016

Volume

2016

Start / End Page

8209453

Location

United States

Related Subject Headings

  • Sensitivity and Specificity
  • Research Design
  • Reproducibility of Results
  • Regression Analysis
  • ROC Curve
  • Prostatic Neoplasms
  • Neoplasm Metastasis
  • Male
  • Humans
  • Gene Expression Regulation, Neoplastic