Skip to main content

Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer.

Publication ,  Conference
Chen, J; Xiu, Z; Goldstein, BA; Henao, R; Carin, L; Tao, C
Published in: Adv Neural Inf Process Syst
December 2021

Dealing with severe class imbalance poses a major challenge for many real-world applications, especially when the accurate classification and generalization of minority classes are of primary interest. In computer vision and NLP, learning from datasets with long-tail behavior is a recurring theme, especially for naturally occurring labels. Existing solutions mostly appeal to sampling or weighting adjustments to alleviate the extreme imbalance, or impose inductive bias to prioritize generalizable associations. Here we take a novel perspective to promote sample efficiency and model generalization based on the invariance principles of causality. Our contribution posits a meta-distributional scenario, where the causal generating mechanism for label-conditional features is invariant across different labels. Such causal assumption enables efficient knowledge transfer from the dominant classes to their under-represented counterparts, even if their feature distributions show apparent disparities. This allows us to leverage a causal data augmentation procedure to enlarge the representation of minority classes. Our development is orthogonal to the existing imbalanced data learning techniques thus can be seamlessly integrated. The proposed approach is validated on an extensive set of synthetic and real-world tasks against state-of-the-art solutions.

Duke Scholars

Published In

Adv Neural Inf Process Syst

ISSN

1049-5258

Publication Date

December 2021

Volume

34

Start / End Page

21229 / 21243

Location

United States

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Chen, J., Xiu, Z., Goldstein, B. A., Henao, R., Carin, L., & Tao, C. (2021). Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer. In Adv Neural Inf Process Syst (Vol. 34, pp. 21229–21243). United States.
Chen, Junya, Zidi Xiu, Benjamin A. Goldstein, Ricardo Henao, Lawrence Carin, and Chenyang Tao. “Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer.” In Adv Neural Inf Process Syst, 34:21229–43, 2021.
Chen J, Xiu Z, Goldstein BA, Henao R, Carin L, Tao C. Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer. In: Adv Neural Inf Process Syst. 2021. p. 21229–43.
Chen, Junya, et al. “Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer.Adv Neural Inf Process Syst, vol. 34, 2021, pp. 21229–43.
Chen J, Xiu Z, Goldstein BA, Henao R, Carin L, Tao C. Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer. Adv Neural Inf Process Syst. 2021. p. 21229–21243.

Published In

Adv Neural Inf Process Syst

ISSN

1049-5258

Publication Date

December 2021

Volume

34

Start / End Page

21229 / 21243

Location

United States

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology