Skip to main content

Feature space perturbations yield more transferable adversarial examples

Publication ,  Conference
Inkawhich, N; Wen, W; Li, HH; Chen, Y
Published in: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
June 1, 2019

Many recent works have shown that deep learning models are vulnerable to quasi-imperceptible input perturbations, yet practitioners cannot fully explain this behavior. This work describes a transfer-based blackbox targeted adversarial attack of deep feature space representations that also provides insights into cross-model class representations of deep CNNs. The attack is explicitly designed for transferability and drives feature space representation of a source image at layer L towards the representation of a target image at L. The attack yields highly transferable targeted examples, which outperform competition winning methods by over 30% in targeted attack metrics. We also show the choice of L to generate examples from is important, transferability characteristics are blackbox model agnostic, and indicate that well trained deep models have similar highly-abstract representations.

Duke Scholars

Published In

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

DOI

ISSN

1063-6919

Publication Date

June 1, 2019

Volume

2019-June

Start / End Page

7059 / 7067
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Inkawhich, N., Wen, W., Li, H. H., & Chen, Y. (2019). Feature space perturbations yield more transferable adversarial examples. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 2019-June, pp. 7059–7067). https://doi.org/10.1109/CVPR.2019.00723
Inkawhich, N., W. Wen, H. H. Li, and Y. Chen. “Feature space perturbations yield more transferable adversarial examples.” In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019-June:7059–67, 2019. https://doi.org/10.1109/CVPR.2019.00723.
Inkawhich N, Wen W, Li HH, Chen Y. Feature space perturbations yield more transferable adversarial examples. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2019. p. 7059–67.
Inkawhich, N., et al. “Feature space perturbations yield more transferable adversarial examples.” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2019-June, 2019, pp. 7059–67. Scopus, doi:10.1109/CVPR.2019.00723.
Inkawhich N, Wen W, Li HH, Chen Y. Feature space perturbations yield more transferable adversarial examples. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2019. p. 7059–7067.

Published In

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

DOI

ISSN

1063-6919

Publication Date

June 1, 2019

Volume

2019-June

Start / End Page

7059 / 7067