Skip to main content
construction release_alert
Scholars@Duke will be undergoing maintenance April 11-15. Some features may be unavailable during this time.
cancel

Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates.

Publication ,  Journal Article
He, K; Li, Y; Zhu, J; Liu, H; Lee, JE; Amos, CI; Hyslop, T; Jin, J; Lin, H; Wei, Q; Li, Y
Published in: Bioinformatics
January 1, 2016

MOTIVATION: Technological advances that allow routine identification of high-dimensional risk factors have led to high demand for statistical techniques that enable full utilization of these rich sources of information for genetics studies. Variable selection for censored outcome data as well as control of false discoveries (i.e. inclusion of irrelevant variables) in the presence of high-dimensional predictors present serious challenges. This article develops a computationally feasible method based on boosting and stability selection. Specifically, we modified the component-wise gradient boosting to improve the computational feasibility and introduced random permutation in stability selection for controlling false discoveries. RESULTS: We have proposed a high-dimensional variable selection method by incorporating stability selection to control false discovery. Comparisons between the proposed method and the commonly used univariate and Lasso approaches for variable selection reveal that the proposed method yields fewer false discoveries. The proposed method is applied to study the associations of 2339 common single-nucleotide polymorphisms (SNPs) with overall survival among cutaneous melanoma (CM) patients. The results have confirmed that BRCA2 pathway SNPs are likely to be associated with overall survival, as reported by previous literature. Moreover, we have identified several new Fanconi anemia (FA) pathway SNPs that are likely to modulate survival of CM patients. AVAILABILITY AND IMPLEMENTATION: The related source code and documents are freely available at https://sites.google.com/site/bestumich/issues. CONTACT: yili@umich.edu.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

January 1, 2016

Volume

32

Issue

1

Start / End Page

50 / 57

Location

England

Related Subject Headings

  • Time Factors
  • Survival Analysis
  • Skin Neoplasms
  • Risk Factors
  • Polymorphism, Single Nucleotide
  • Melanoma, Cutaneous Malignant
  • Melanoma
  • Humans
  • Computer Simulation
  • Bioinformatics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
He, K., Li, Y., Zhu, J., Liu, H., Lee, J. E., Amos, C. I., … Wei, Q. (2016). Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates. Bioinformatics, 32(1), 50–57. https://doi.org/10.1093/bioinformatics/btv517
He, Kevin, Yanming Li, Ji Zhu, Hongliang Liu, Jeffrey E. Lee, Christopher I. Amos, Terry Hyslop, et al. “Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates.Bioinformatics 32, no. 1 (January 1, 2016): 50–57. https://doi.org/10.1093/bioinformatics/btv517.
He K, Li Y, Zhu J, Liu H, Lee JE, Amos CI, et al. Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates. Bioinformatics. 2016 Jan 1;32(1):50–7.
He, Kevin, et al. “Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates.Bioinformatics, vol. 32, no. 1, Jan. 2016, pp. 50–57. Pubmed, doi:10.1093/bioinformatics/btv517.
He K, Li Y, Zhu J, Liu H, Lee JE, Amos CI, Hyslop T, Jin J, Lin H, Wei Q. Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates. Bioinformatics. 2016 Jan 1;32(1):50–57.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

January 1, 2016

Volume

32

Issue

1

Start / End Page

50 / 57

Location

England

Related Subject Headings

  • Time Factors
  • Survival Analysis
  • Skin Neoplasms
  • Risk Factors
  • Polymorphism, Single Nucleotide
  • Melanoma, Cutaneous Malignant
  • Melanoma
  • Humans
  • Computer Simulation
  • Bioinformatics