Scholars@Duke publication: Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models

Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models

Publication , Conference

Liu, H; Reiter, MK; Gong, NZ

Published in: Proceedings of the 33rd Usenix Security Symposium

January 1, 2024

Foundation model has become the backbone of the AI ecosystem. In particular, a foundation model can be used as a general-purpose feature extractor to build various downstream classifiers. However, foundation models are vulnerable to backdoor attacks and a backdoored foundation model is a single-point-of-failure of the AI ecosystem, e.g., multiple downstream classifiers inherit the backdoor vulnerabilities simultaneously. In this work, we propose Mudjacking, the first method to patch foundation models to remove backdoors. Specifically, given a misclassified trigger-embedded input detected after a backdoored foundation model is deployed, Mudjacking adjusts the parameters of the foundation model to remove the backdoor. We formulate patching a foundation model as an optimization problem and propose a gradient descent based method to solve it. We evaluate Mudjacking on both vision and language foundation models, eleven benchmark datasets, five existing backdoor attacks, and thirteen adaptive backdoor attacks. Our results show that Mudjacking can remove backdoor from a foundation model while maintaining its utility.

Duke Scholars

Author Michael Reiter Computer Science

Published In

Proceedings of the 33rd Usenix Security Symposium

Publication Date

January 1, 2024

Start / End Page

2919 / 2936

Citation

APA

Chicago

ICMJE

MLA

NLM

Liu, H., Reiter, M. K., & Gong, N. Z. (2024). Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models. In Proceedings of the 33rd Usenix Security Symposium (pp. 2919–2936).

Liu, H., M. K. Reiter, and N. Z. Gong. “Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models.” In Proceedings of the 33rd Usenix Security Symposium, 2919–36, 2024.

Liu H, Reiter MK, Gong NZ. Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models. In: Proceedings of the 33rd Usenix Security Symposium. 2024. p. 2919–36.

Liu, H., et al. “Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models.” Proceedings of the 33rd Usenix Security Symposium, 2024, pp. 2919–36.

Liu H, Reiter MK, Gong NZ. Mudjacking: Patching Backdoor Vulnerabilities in Foundation Models. Proceedings of the 33rd Usenix Security Symposium. 2024. p. 2919–2936.

Published In

Proceedings of the 33rd Usenix Security Symposium

Publication Date

January 1, 2024

Start / End Page

2919 / 2936