Skip to main content

Boosting Adversarial Robustness with CLAT: Criticality-Leveraged Adversarial Training

Publication ,  Conference
Gopal, B; Yang, H; Zhang, J; Horton, M; Chen, Y
Published in: Proceedings of Machine Learning Research
January 1, 2025

Adversarial training (AT) enhances neural network robustness. Typically, AT updates all trainable parameters, but can lead to overfitting and increased errors on clean data. Research suggests that fine-tuning specific parameters may be more effective; however, methods for identifying these essential parameters and establishing effective optimization objectives remain inadequately addressed. We present CLAT, an innovative adversarial fine-tuning algorithm that mitigates adversarial overfitting by integrating “criticality” into the training process. Instead of tuning the entire model, CLAT identifies and fine-tunes fewer parameters in robustness-critical layers—those predominantly learning non-robust features—while keeping the rest of the model fixed. Additionally, CLAT employs a dynamic layer selection process that adapts to changes in layer criticality during training. Empirical results demonstrate that CLAT can be seamlessly integrated with existing adversarial training methods, enhancing clean accuracy and adversarial robustness by over 2% compared to baseline approaches.

Duke Scholars

Published In

Proceedings of Machine Learning Research

EISSN

2640-3498

Publication Date

January 1, 2025

Volume

267

Start / End Page

20142 / 20161
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Gopal, B., Yang, H., Zhang, J., Horton, M., & Chen, Y. (2025). Boosting Adversarial Robustness with CLAT: Criticality-Leveraged Adversarial Training. In Proceedings of Machine Learning Research (Vol. 267, pp. 20142–20161).
Gopal, B., H. Yang, J. Zhang, M. Horton, and Y. Chen. “Boosting Adversarial Robustness with CLAT: Criticality-Leveraged Adversarial Training.” In Proceedings of Machine Learning Research, 267:20142–61, 2025.
Gopal B, Yang H, Zhang J, Horton M, Chen Y. Boosting Adversarial Robustness with CLAT: Criticality-Leveraged Adversarial Training. In: Proceedings of Machine Learning Research. 2025. p. 20142–61.
Gopal, B., et al. “Boosting Adversarial Robustness with CLAT: Criticality-Leveraged Adversarial Training.” Proceedings of Machine Learning Research, vol. 267, 2025, pp. 20142–61.
Gopal B, Yang H, Zhang J, Horton M, Chen Y. Boosting Adversarial Robustness with CLAT: Criticality-Leveraged Adversarial Training. Proceedings of Machine Learning Research. 2025. p. 20142–20161.

Published In

Proceedings of Machine Learning Research

EISSN

2640-3498

Publication Date

January 1, 2025

Volume

267

Start / End Page

20142 / 20161