Skip to main content
Journal cover image

Improving GBDT performance on imbalanced datasets: An empirical study of class-balanced loss functions

Publication ,  Journal Article
Luo, J; Yuan, Y; Xu, S
Published in: Neurocomputing
June 14, 2025

Class imbalance poses a persistent challenge in machine learning, particularly for tabular data classification tasks. While Gradient Boosting Decision Trees (GBDT) models are widely regarded as state-of-the-art for these tasks, their effectiveness diminishes in the presence of imbalanced datasets. This paper is the first to comprehensively explore the integration of class-balanced loss functions into three popular GBDT algorithms, addressing binary, multi-class, and multi-label classification. We present a novel benchmark, derived from extensive experiments across diverse datasets, to evaluate the performance gains from class-balanced losses in GBDT models. Our findings establish the efficacy of these loss functions in enhancing model performance under class imbalance, providing actionable insights for practitioners tackling real-world imbalanced data challenges. To bridge the gap between research and practice, we introduce an open-source Python package that simplifies the application of class-balanced loss functions within GBDT workflows, democratizing access to these advanced methodologies. The code is available at https://github.com/Luojiaqimath/ClassbalancedLoss4GBDT.

Duke Scholars

Published In

Neurocomputing

DOI

EISSN

1872-8286

ISSN

0925-2312

Publication Date

June 14, 2025

Volume

634

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 52 Psychology
  • 46 Information and computing sciences
  • 40 Engineering
  • 17 Psychology and Cognitive Sciences
  • 09 Engineering
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Luo, J., Yuan, Y., & Xu, S. (2025). Improving GBDT performance on imbalanced datasets: An empirical study of class-balanced loss functions. Neurocomputing, 634. https://doi.org/10.1016/j.neucom.2025.129896
Luo, J., Y. Yuan, and S. Xu. “Improving GBDT performance on imbalanced datasets: An empirical study of class-balanced loss functions.” Neurocomputing 634 (June 14, 2025). https://doi.org/10.1016/j.neucom.2025.129896.
Luo, J., et al. “Improving GBDT performance on imbalanced datasets: An empirical study of class-balanced loss functions.” Neurocomputing, vol. 634, June 2025. Scopus, doi:10.1016/j.neucom.2025.129896.
Journal cover image

Published In

Neurocomputing

DOI

EISSN

1872-8286

ISSN

0925-2312

Publication Date

June 14, 2025

Volume

634

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 52 Psychology
  • 46 Information and computing sciences
  • 40 Engineering
  • 17 Psychology and Cognitive Sciences
  • 09 Engineering
  • 08 Information and Computing Sciences