Skip to main content
Journal cover image

Multi-scale Attention Consistency for Multi-label Image Classification

Publication ,  Chapter
Xu, H; Jin, X; Wang, Q; Huang, K
January 1, 2020

Human has well demonstrated its cognitive consistency over image transformations such as flipping and scaling. In order to learn from human’s visual perception consistency, researchers find out that convolutional neural network’s capacity of discernment can be further elevated via forcing the network to concentrate on certain area in the picture in accordance with the human natural visual perception. Attention heatmap, as a supplementary tool to reveal the essential region that the network chooses to focus on, has been developed and widely adopted by CNNs. Based on this regime of visual consistency, we propose a novel end-to-end trainable CNN architecture with multi-scale attention consistency. Specifically, our model takes an original picture and its flipped counterpart as inputs, and then send them into a single standard Resnet with additional attention-enhanced modules to generate a semantically strong attention heatmap. We also compute the distance between multi-scale attention heatmaps of these two pictures and take it as an additional loss to help the network achieve better performance. Our network shows superiority on the multi-label classification task and attains compelling results on the WIDER Attribute Dataset.

Duke Scholars

DOI

ISBN

9783030638191

Publication Date

January 1, 2020

Volume

1332

Start / End Page

815 / 823
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Xu, H., Jin, X., Wang, Q., & Huang, K. (2020). Multi-scale Attention Consistency for Multi-label Image Classification (Vol. 1332, pp. 815–823). https://doi.org/10.1007/978-3-030-63820-7_93
Xu, H., X. Jin, Q. Wang, and K. Huang. “Multi-scale Attention Consistency for Multi-label Image Classification,” 1332:815–23, 2020. https://doi.org/10.1007/978-3-030-63820-7_93.
Xu H, Jin X, Wang Q, Huang K. Multi-scale Attention Consistency for Multi-label Image Classification. In 2020. p. 815–23.
Xu, H., et al. Multi-scale Attention Consistency for Multi-label Image Classification. Vol. 1332, 2020, pp. 815–23. Scopus, doi:10.1007/978-3-030-63820-7_93.
Xu H, Jin X, Wang Q, Huang K. Multi-scale Attention Consistency for Multi-label Image Classification. 2020. p. 815–823.
Journal cover image

DOI

ISBN

9783030638191

Publication Date

January 1, 2020

Volume

1332

Start / End Page

815 / 823