Skip to main content
Journal cover image

AG-YOLO: Attention-guided network for real-time object detection

Publication ,  Journal Article
Zhu, H; Sun, L; Qin, W; Tian, F
Published in: Multimedia Tools and Applications
March 1, 2024

Existing neural network models directly add attention mechanisms to the network as a plug-and-play component to capture long-range dependencies and reconstruct feature maps. However, most methods do not fully tap the potential of attention in dealing with multi-scale problems. In this paper, an attention-guided YOLOv4 network (AG-YOLO) is proposed to address the multi-scale issue in object detection. We propose and apply multi-scale feature extraction to later stages of the backbone, which can not only enrich the feature hierarchy with low computational overhead, but also model the intra-scale and inter-scale correlation simultaneously to avoid missing key information. To reduce the redundant use of information flow, we propose a lightweight attention-guided feature pyramid network, which provides an efficient multi-level aggregation strategy based on multi-scale channel attention. In addition, a global context pathway is designed to reduce the dilution of high-level semantic information caused by information transmission. Compared with the baseline, AG-YOLO increased the mAP_0.5 by 1.67%, while the number of parameters and GFLOPs merely increased by 0.33M and 0.18, respectively. Meanwhile, the detection accuracy of small object categories has been improved.

Duke Scholars

Published In

Multimedia Tools and Applications

DOI

EISSN

1573-7721

ISSN

1380-7501

Publication Date

March 1, 2024

Volume

83

Issue

9

Start / End Page

28197 / 28213

Related Subject Headings

  • Software Engineering
  • Artificial Intelligence & Image Processing
  • 4606 Distributed computing and systems software
  • 4605 Data management and data science
  • 4603 Computer vision and multimedia computation
  • 4009 Electronics, sensors and digital hardware
  • 0806 Information Systems
  • 0805 Distributed Computing
  • 0803 Computer Software
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhu, H., Sun, L., Qin, W., & Tian, F. (2024). AG-YOLO: Attention-guided network for real-time object detection. Multimedia Tools and Applications, 83(9), 28197–28213. https://doi.org/10.1007/s11042-023-16568-3
Zhu, H., L. Sun, W. Qin, and F. Tian. “AG-YOLO: Attention-guided network for real-time object detection.” Multimedia Tools and Applications 83, no. 9 (March 1, 2024): 28197–213. https://doi.org/10.1007/s11042-023-16568-3.
Zhu H, Sun L, Qin W, Tian F. AG-YOLO: Attention-guided network for real-time object detection. Multimedia Tools and Applications. 2024 Mar 1;83(9):28197–213.
Zhu, H., et al. “AG-YOLO: Attention-guided network for real-time object detection.” Multimedia Tools and Applications, vol. 83, no. 9, Mar. 2024, pp. 28197–213. Scopus, doi:10.1007/s11042-023-16568-3.
Zhu H, Sun L, Qin W, Tian F. AG-YOLO: Attention-guided network for real-time object detection. Multimedia Tools and Applications. 2024 Mar 1;83(9):28197–28213.
Journal cover image

Published In

Multimedia Tools and Applications

DOI

EISSN

1573-7721

ISSN

1380-7501

Publication Date

March 1, 2024

Volume

83

Issue

9

Start / End Page

28197 / 28213

Related Subject Headings

  • Software Engineering
  • Artificial Intelligence & Image Processing
  • 4606 Distributed computing and systems software
  • 4605 Data management and data science
  • 4603 Computer vision and multimedia computation
  • 4009 Electronics, sensors and digital hardware
  • 0806 Information Systems
  • 0805 Distributed Computing
  • 0803 Computer Software
  • 0801 Artificial Intelligence and Image Processing