Scholars@Duke publication: Feature space perturbations yield more transferable adversarial examples

Feature space perturbations yield more transferable adversarial examples

Publication , Conference

Inkawhich, N; Wen, W; Li, HH; Chen, Y

Published in: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

June 1, 2019

Many recent works have shown that deep learning models are vulnerable to quasi-imperceptible input perturbations, yet practitioners cannot fully explain this behavior. This work describes a transfer-based blackbox targeted adversarial attack of deep feature space representations that also provides insights into cross-model class representations of deep CNNs. The attack is explicitly designed for transferability and drives feature space representation of a source image at layer L towards the representation of a target image at L. The attack yields highly transferable targeted examples, which outperform competition winning methods by over 30% in targeted attack metrics. We also show the choice of L to generate examples from is important, transferability characteristics are blackbox model agnostic, and indicate that well trained deep models have similar highly-abstract representations.

Duke Scholars

Author Hai "Helen" Li Electrical and Computer Engineering

Author Yiran Chen Electrical and Computer Engineering

Published In

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

DOI

10.1109/CVPR.2019.00723

ISSN

1063-6919

Publication Date

June 1, 2019

Volume

2019-June

Start / End Page

7059 / 7067

Citation

APA

Chicago

ICMJE

MLA

NLM

Inkawhich, N., Wen, W., Li, H. H., & Chen, Y. (2019). Feature space perturbations yield more transferable adversarial examples. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 2019-June, pp. 7059–7067). https://doi.org/10.1109/CVPR.2019.00723

Inkawhich, N., W. Wen, H. H. Li, and Y. Chen. “Feature space perturbations yield more transferable adversarial examples.” In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019-June:7059–67, 2019. https://doi.org/10.1109/CVPR.2019.00723.

Inkawhich N, Wen W, Li HH, Chen Y. Feature space perturbations yield more transferable adversarial examples. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2019. p. 7059–67.

Inkawhich, N., et al. “Feature space perturbations yield more transferable adversarial examples.” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2019-June, 2019, pp. 7059–67. Scopus, doi:10.1109/CVPR.2019.00723.

Inkawhich N, Wen W, Li HH, Chen Y. Feature space perturbations yield more transferable adversarial examples. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2019. p. 7059–7067.

Published In

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

DOI

10.1109/CVPR.2019.00723

ISSN

1063-6919

Publication Date

June 1, 2019

Volume

2019-June

Start / End Page

7059 / 7067