Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

Predictive representations: Building blocks of intelligence

W Carvalho, MS Tomov, W de Cothi, C Barry… - Neural …, 2024 - direct.mit.edu
Adaptive behavior often requires predicting future events. The theory of reinforcement
learning prescribes what kinds of predictive representations are useful and how to compute …

Pac: Assisted value factorization with counterfactual predictions in multi-agent reinforcement learning

H Zhou, T Lan, V Aggarwal - Advances in Neural …, 2022 - proceedings.neurips.cc
Multi-agent reinforcement learning (MARL) has witnessed significant progress with the
development of value function factorization methods. It allows optimizing a joint action-value …

Automatic grou** for efficient cooperative multi-agent reinforcement learning

Y Zang, J He, K Li, H Fu, Q Fu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Grou** is ubiquitous in natural systems and is essential for promoting efficiency in team
coordination. This paper proposes a novel formulation of Group-oriented Multi-Agent …

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arxiv preprint arxiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Combining behaviors with the successor features keyboard

WC Carvalho, A Saraiva, A Filos… - Advances in …, 2024 - proceedings.neurips.cc
Abstract The Option Keyboard (OK) was recently proposed as a method for transferring
behavioral knowledge across tasks. OK transfers knowledge by adaptively combining …

Ldsa: Learning dynamic subtask assignment in cooperative multi-agent reinforcement learning

M Yang, J Zhao, X Hu, W Zhou… - Advances in Neural …, 2022 - proceedings.neurips.cc
Cooperative multi-agent reinforcement learning (MARL) has made prominent progress in
recent years. For training efficiency and scalability, most of the MARL algorithms make all …

Tesseract: Tensorised actors for multi-agent reinforcement learning

A Mahajan, M Samvelyan, L Mao… - International …, 2021 - proceedings.mlr.press
Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …

Semantically aligned task decomposition in multi-agent reinforcement learning

W Li, D Qiao, B Wang, X Wang, B **, H Zha - arxiv preprint arxiv …, 2023 - arxiv.org
The difficulty of appropriately assigning credit is particularly heightened in cooperative
MARL with sparse reward, due to the concurrent time and structural scales involved …

Regularized softmax deep multi-agent q-learning

L Pan, T Rashid, B Peng, L Huang… - Advances in Neural …, 2021 - proceedings.neurips.cc
Tackling overestimation in $ Q $-learning is an important problem that has been extensively
studied in single-agent reinforcement learning, but has received comparatively little attention …