Skip to main content

A Coordination Optimization Framework for Multi-Agent Reinforcement Learning Based on Reward Redistribution and Experience Reutilization

Publication ,  Journal Article
Yang, B; Gao, L; Zhou, F; Yao, H; Fu, Y; Sun, Z; Tian, F; Ren, H
Published in: Electronics Switzerland
June 1, 2025

Cooperative multi-agent reinforcement learning (MARL) has emerged as a powerful paradigm for addressing complex real-world challenges, including autonomous robot control, strategic decision-making, and decentralized coordination in unmanned swarm systems. However, it still faces challenges in learning proper coordination among multiple agents. The lack of effective knowledge sharing and experience interaction mechanisms among agents has led to substantial performance decline, especially in terms of low sampling efficiency and slow convergence rates, ultimately constraining the practical applicability of MARL. To address these challenges, this paper proposes a novel framework termed Reward redistribution and Experience reutilization based Coordination Optimization (RECO). This innovative approach employs a hierarchical experience pool mechanism that enhances exploration through strategic reward redistribution and experience reutilization. The RECO framework incorporates a sophisticated evaluation mechanism that assesses the quality of historical sampling data from individual agents and optimizes reward distribution by maximizing mutual information across hierarchical experience trajectories. Extensive comparative analyses of computational efficiency and performance metrics across diverse environments reveal that the proposed method not only enhances training efficiency in multi-agent gaming scenarios but also significantly strengthens algorithmic robustness and stability in dynamic environments.

Duke Scholars

Published In

Electronics Switzerland

DOI

EISSN

2079-9292

Publication Date

June 1, 2025

Volume

14

Issue

12

Related Subject Headings

  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yang, B., Gao, L., Zhou, F., Yao, H., Fu, Y., Sun, Z., … Ren, H. (2025). A Coordination Optimization Framework for Multi-Agent Reinforcement Learning Based on Reward Redistribution and Experience Reutilization. Electronics Switzerland, 14(12). https://doi.org/10.3390/electronics14122361
Yang, B., L. Gao, F. Zhou, H. Yao, Y. Fu, Z. Sun, F. Tian, and H. Ren. “A Coordination Optimization Framework for Multi-Agent Reinforcement Learning Based on Reward Redistribution and Experience Reutilization.” Electronics Switzerland 14, no. 12 (June 1, 2025). https://doi.org/10.3390/electronics14122361.
Yang B, Gao L, Zhou F, Yao H, Fu Y, Sun Z, et al. A Coordination Optimization Framework for Multi-Agent Reinforcement Learning Based on Reward Redistribution and Experience Reutilization. Electronics Switzerland. 2025 Jun 1;14(12).
Yang, B., et al. “A Coordination Optimization Framework for Multi-Agent Reinforcement Learning Based on Reward Redistribution and Experience Reutilization.” Electronics Switzerland, vol. 14, no. 12, June 2025. Scopus, doi:10.3390/electronics14122361.
Yang B, Gao L, Zhou F, Yao H, Fu Y, Sun Z, Tian F, Ren H. A Coordination Optimization Framework for Multi-Agent Reinforcement Learning Based on Reward Redistribution and Experience Reutilization. Electronics Switzerland. 2025 Jun 1;14(12).

Published In

Electronics Switzerland

DOI

EISSN

2079-9292

Publication Date

June 1, 2025

Volume

14

Issue

12

Related Subject Headings

  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering