Efficient and scalable reinforcement learning for large-scale network control

C Ma, A Li, Y Du, H Dong, Y Yang - Nature Machine Intelligence, 2024 - nature.com
The primary challenge in the development of large-scale artificial intelligence (AI) systems
lies in achieving scalable decision-making—extending the AI models while maintaining …

Proagent: Building proactive cooperative ai with large language models

C Zhang, K Yang, S Hu, Z Wang, G Li, Y Sun, C Zhang… - CoRR, 2023 - openreview.net
Building agents with adaptive behavior in cooperative tasks stands as a paramount goal in
the realm of multi-agent systems. Current approaches to develo** cooperative agents rely …

ProAgent: building proactive cooperative agents with large language models

C Zhang, K Yang, S Hu, Z Wang, G Li, Y Sun… - Proceedings of the …, 2024 - ojs.aaai.org
Building agents with adaptive behavior in cooperative tasks stands as a paramount goal in
the realm of multi-agent systems. Current approaches to develo** cooperative agents rely …

Taxai: A dynamic economic simulator and benchmark for multi-agent reinforcement learning

Q Mi, S **a, Y Song, H Zhang, S Zhu… - arxiv preprint arxiv …, 2023 - arxiv.org
Taxation and government spending are crucial tools for governments to promote economic
growth and maintain social equity. However, the difficulty in accurately predicting the …

Optimistic sequential multi-agent reinforcement learning with motivational communication

A Huang, Y Wang, X Zhou, H Zou, X Dong, X Che - Neural Networks, 2024 - Elsevier
Abstract Centralized Training with Decentralized Execution (CTDE) is a prevalent paradigm
in the field of fully cooperative Multi-Agent Reinforcement Learning (MARL). Existing …

Byzantine robust cooperative multi-agent reinforcement learning as a bayesian game

S Li, J Guo, J **u, R Xu, X Yu, J Wang, A Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
In this study, we explore the robustness of cooperative multi-agent reinforcement learning (c-
MARL) against Byzantine failures, where any agent can enact arbitrary, worst-case actions …

An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control

D Guo, L Tang, X Zhang, Y Liang - Neural Networks, 2024 - Elsevier
Multi-agent reinforcement learning (MARL) algorithms based on trust regions (TR) have
achieved significant success in numerous cooperative multi-agent tasks. These algorithms …

Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks

P Feng, J Liang, S Wang, X Yu, X Ji… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
In multi-agent reinforcement learning (MARL), the Centralized Training with Decentralized
Execution (CTDE) framework is pivotal but struggles due to a gap: global state guidance in …

Enhancing Collaboration in Heterogeneous Multiagent Systems Through Communication Complementary Graph

K Peng, T Ma, L Jia, H Rong - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Heterogeneous multiagent systems are characterized by diverse task distributions, which
are prevalent in practical scenarios, such as distributed decision making and robotic …

FTR‐Bench: Benchmarking Deep Reinforcement Learning for Flipper‐Track Robot Control

H Zhang, J Ren, J **ao, H Pan, H Lu… - Journal of Field …, 2025 - Wiley Online Library
Tracked robots equipped with flippers and sensors are extensively employed in outdoor
search and rescue scenarios. However, achieving precise motion control on complex …