Skip to main content

Random Survival Forest Machine Learning for the Prediction of Cardiovascular Events Among Patients With a Measured Lipoprotein(a) Level: A Model Development Study.

Publication ,  Journal Article
Lusk, JB; O'Brien, EC; Hammill, BG; Li, F; Mac Grory, B; Patel, MR; Pagidipati, NJ; Shah, NP
Published in: Circ Genom Precis Med
February 2025

BACKGROUND: Established risk models may not be applicable to patients at higher cardiovascular risk with a measured Lp(a) (lipoprotein[a]) level, a causal risk factor for atherosclerotic cardiovascular disease. METHODS: This was a model development study. The data source was the Nashville Biosciences Lp(a) data set, which includes clinical data from the Vanderbilt University Health System. We included patients with an Lp(a) measured between 1989 and 2022 and who had at least 1 year of electronic health record data before measurement of an Lp(a) level. The end point of interest was time to first myocardial infarction, stroke/TIA, or coronary revascularization. A random survival forest model was derived and compared with a Cox proportional hazards model derived from traditional cardiovascular risk factors (ie, the variables used to estimate the Pooled Cohort Equations for the primary prevention population and the variables used to estimate the Second Manifestations of Arterial Disease and Thrombolysis in Myocardial Infarction Risk Score for Secondary Prevention scores for the secondary prevention population). Model discrimination was evaluated using Harrell's C-index. RESULTS: A total of 4369 patients were included in the study (49.5% were female, mean age was 51 [SD 18] years, and mean Lp(a) level was 33.6 [38.6] mg/dL, of whom 23.7% had a prior cardiovascular event). The random survival forest model outperformed the traditional risk factor models in the test set (c-index, 0.82 [random forest model] versus 0.69 [primary prevention model] versus 0.80 [secondary prevention model]). These results were similar when restricted to a primary prevention population and under various strategies to handle competing risk. A Cox proportional hazard model based on the top 25 variables from the random forest model had a c-index of 0.80. CONCLUSIONS: A random survival forest model outperformed a model using traditional risk factors for predicting cardiovascular events in patients with a measured Lp(a) level.

Duke Scholars

Published In

Circ Genom Precis Med

DOI

EISSN

2574-8300

Publication Date

February 2025

Volume

18

Issue

1

Start / End Page

e004629

Location

United States

Related Subject Headings

  • Risk Factors
  • Risk Assessment
  • Proportional Hazards Models
  • Middle Aged
  • Male
  • Machine Learning
  • Lipoprotein(a)
  • Humans
  • Female
  • Cardiovascular Diseases
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Lusk, J. B., O’Brien, E. C., Hammill, B. G., Li, F., Mac Grory, B., Patel, M. R., … Shah, N. P. (2025). Random Survival Forest Machine Learning for the Prediction of Cardiovascular Events Among Patients With a Measured Lipoprotein(a) Level: A Model Development Study. Circ Genom Precis Med, 18(1), e004629. https://doi.org/10.1161/CIRCGEN.124.004629
Lusk, Jay B., Emily C. O’Brien, Bradley G. Hammill, Fan Li, Brian Mac Grory, Manesh R. Patel, Neha J. Pagidipati, and Nishant P. Shah. “Random Survival Forest Machine Learning for the Prediction of Cardiovascular Events Among Patients With a Measured Lipoprotein(a) Level: A Model Development Study.Circ Genom Precis Med 18, no. 1 (February 2025): e004629. https://doi.org/10.1161/CIRCGEN.124.004629.
Lusk JB, O’Brien EC, Hammill BG, Li F, Mac Grory B, Patel MR, et al. Random Survival Forest Machine Learning for the Prediction of Cardiovascular Events Among Patients With a Measured Lipoprotein(a) Level: A Model Development Study. Circ Genom Precis Med. 2025 Feb;18(1):e004629.
Lusk, Jay B., et al. “Random Survival Forest Machine Learning for the Prediction of Cardiovascular Events Among Patients With a Measured Lipoprotein(a) Level: A Model Development Study.Circ Genom Precis Med, vol. 18, no. 1, Feb. 2025, p. e004629. Pubmed, doi:10.1161/CIRCGEN.124.004629.
Lusk JB, O’Brien EC, Hammill BG, Li F, Mac Grory B, Patel MR, Pagidipati NJ, Shah NP. Random Survival Forest Machine Learning for the Prediction of Cardiovascular Events Among Patients With a Measured Lipoprotein(a) Level: A Model Development Study. Circ Genom Precis Med. 2025 Feb;18(1):e004629.

Published In

Circ Genom Precis Med

DOI

EISSN

2574-8300

Publication Date

February 2025

Volume

18

Issue

1

Start / End Page

e004629

Location

United States

Related Subject Headings

  • Risk Factors
  • Risk Assessment
  • Proportional Hazards Models
  • Middle Aged
  • Male
  • Machine Learning
  • Lipoprotein(a)
  • Humans
  • Female
  • Cardiovascular Diseases