Skip to main content

Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training

Publication ,  Conference
Jiang, C; Huang, K; Zhang, S; Wang, X; Xiao, J
Published in: MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia
October 12, 2020

Deep neural network with multi-scale feature fusion has achieved great success in human pose estimation. However, drawbacks still exist in these methods: 1) they consider multi-scale features equally, which may over-emphasize redundant features; 2) preferring deeper structures, they can learn features with the strong semantic representation, but tend to lose natural discriminative information; 3) to attain good performance, they rely heavily on pretraining, which is time-consuming, or even unavailable practically. To mitigate these problems, we propose a novel comprehensive recalibration model called Pyramid GAting Network (PGA-Net) that is capable of distillating, selecting, and fusing the discriminative and attention-aware features at different scales and different levels (i.e., both semantic and natural levels). Meanwhile, focusing on fusing features both selectively and comprehensively, PGA-Net can demonstrate remarkable stability and encouraging performance even without pre-training, making the model can be trained truly from scratch. We demonstrate the effectiveness of PGA-Net through validating on COCO and MPII benchmarks, attaining new state-of-the-art performance. https://github.com/ssr0512/PGA-Net

Duke Scholars

Published In

MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

DOI

Publication Date

October 12, 2020

Start / End Page

2364 / 2371
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Jiang, C., Huang, K., Zhang, S., Wang, X., & Xiao, J. (2020). Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 2364–2371). https://doi.org/10.1145/3394171.3414041
Jiang, C., K. Huang, S. Zhang, X. Wang, and J. Xiao. “Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training.” In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia, 2364–71, 2020. https://doi.org/10.1145/3394171.3414041.
Jiang C, Huang K, Zhang S, Wang X, Xiao J. Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training. In: MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia. 2020. p. 2364–71.
Jiang, C., et al. “Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training.” MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia, 2020, pp. 2364–71. Scopus, doi:10.1145/3394171.3414041.
Jiang C, Huang K, Zhang S, Wang X, Xiao J. Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training. MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia. 2020. p. 2364–2371.

Published In

MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

DOI

Publication Date

October 12, 2020

Start / End Page

2364 / 2371