Skip to main content

Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks

Publication ,  Conference
Li, H; Shan, S; Wenger, E; Zhang, J; Zheng, H; Zhao, BY
Published in: Proceedings of the 31st USENIX Security Symposium, Security 2022
January 1, 2022

Deep learning systems are known to be vulnerable to adversarial examples. In particular, query-based black-box attacks do not require knowledge of the deep learning model, but can compute adversarial examples over the network by submitting queries and inspecting returns. Recent work largely improves the efficiency of those attacks, demonstrating their practicality on today's ML-as-a-service platforms. We propose Blacklight, a new defense against query-based black-box adversarial attacks. Blacklight is driven by a fundamental insight: to compute adversarial examples, these attacks perform iterative optimization over the network, producing queries highly similar in the input space. Thus Blacklight detects query-based black-box attacks by detecting highly similar queries, using an efficient similarity engine operating on probabilistic content fingerprints. We evaluate Blacklight against eight state-of-the-art attacks, across a variety of models and image classification tasks. Blacklight identifies them all, often after only a handful of queries. By rejecting all detected queries, Blacklight prevents any attack from completing, even when persistent attackers continue to submit queries after banned accounts or rejected queries. Blacklight is also robust against several powerful countermeasures, including an optimal black-box attack that approximates white-box attacks in efficiency. Finally, we illustrate how Blacklight generalizes to other domains like text classification.

Duke Scholars

Published In

Proceedings of the 31st USENIX Security Symposium, Security 2022

Publication Date

January 1, 2022

Start / End Page

2117 / 2134
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, H., Shan, S., Wenger, E., Zhang, J., Zheng, H., & Zhao, B. Y. (2022). Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks. In Proceedings of the 31st USENIX Security Symposium, Security 2022 (pp. 2117–2134).
Li, H., S. Shan, E. Wenger, J. Zhang, H. Zheng, and B. Y. Zhao. “Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks.” In Proceedings of the 31st USENIX Security Symposium, Security 2022, 2117–34, 2022.
Li H, Shan S, Wenger E, Zhang J, Zheng H, Zhao BY. Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks. In: Proceedings of the 31st USENIX Security Symposium, Security 2022. 2022. p. 2117–34.
Li, H., et al. “Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks.” Proceedings of the 31st USENIX Security Symposium, Security 2022, 2022, pp. 2117–34.
Li H, Shan S, Wenger E, Zhang J, Zheng H, Zhao BY. Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks. Proceedings of the 31st USENIX Security Symposium, Security 2022. 2022. p. 2117–2134.

Published In

Proceedings of the 31st USENIX Security Symposium, Security 2022

Publication Date

January 1, 2022

Start / End Page

2117 / 2134