A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - ar**: A consensus-oriented strategy for multi-agent reinforcement learning
J Ruan, X Hao, D Li, H Mao - ECAI 2023, 2023 - ebooks.iospress.nl
Multi-agent systems require effective coordination between groups and individuals to
achieve common goals. However, current multi-agent reinforcement learning (MARL) …

Attention-guided contrastive role representations for multi-agent reinforcement learning

Z Hu, Z Zhang, H Li, C Chen, H Ding… - arxiv preprint arxiv …, 2023 - arxiv.org
Real-world multi-agent tasks usually involve dynamic team composition with the emergence
of roles, which should also be a key to efficient cooperation in multi-agent reinforcement …

Coordinating Multi-Agent Reinforcement Learning via Dual Collaborative Constraints

C Li, S Dong, S Yang, Y Hu, W Li, Y Gao - Neural Networks, 2025 - Elsevier
Many real-world multi-agent tasks exhibit a nearly decomposable structure, where
interactions among agents within the same interaction set are strong while interactions …

Self-motivated multi-agent exploration

S Zhang, J Cao, L Yuan, Y Yu, DC Zhan - arxiv preprint arxiv:2301.02083, 2023 - arxiv.org
In cooperative multi-agent reinforcement learning (CMARL), it is critical for agents to achieve
a balance between self-exploration and team collaboration. However, agents can hardly …

MaDE: Multi-Scale Decision Enhancement for Multi-Agent Reinforcement Learning

J Ruan, R **e, X **ong, S Xu… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
In the domain of multi-agent reinforcement learning (MARL), the limited information
availability, complex agent interactions, and individual capabilities among agents often pose …

Robust cooperative multi-agent reinforcement learning via multi-view message certification

L Yuan, T Jiang, L Li, F Chen, Z Zhang, Y Yu - Science China Information …, 2024 - Springer
Many multi-agent scenarios require message sharing among agents to promote
coordination, hastening the robustness of multi-agent communication when policies are …