A survey on reinforcement learning methods in character animation

A Kwiatkowski, E Alvarado, V Kalogeiton… - Computer Graphics …, 2022 - Wiley Online Library
Reinforcement Learning is an area of Machine Learning focused on how agents can be
trained to make sequential decisions, and achieve a particular goal within an arbitrary …

The surprising effectiveness of ppo in cooperative multi-agent games

C Yu, A Velu, E Vinitsky, J Gao… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …

A review of cooperation in multi-agent learning

Y Du, JZ Leibo, U Islam, R Willis, P Sunehag - arxiv preprint arxiv …, 2023 - arxiv.org
Cooperation in multi-agent learning (MAL) is a topic at the intersection of numerous
disciplines, including game theory, economics, social sciences, and evolutionary biology …

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

[書籍][B] Multi-agent reinforcement learning: Foundations and modern approaches

SV Albrecht, F Christianos, L Schäfer - 2024 - books.google.com
The first comprehensive introduction to Multi-Agent Reinforcement Learning (MARL),
covering MARL's models, solution concepts, algorithmic ideas, technical challenges, and …

Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning

T Rashid, G Farquhar, B Peng… - Advances in neural …, 2020 - proceedings.neurips.cc
QMIX is a popular $ Q $-learning algorithm for cooperative MARL in the centralised training
and decentralised execution paradigm. In order to enable easy decentralisation, QMIX …

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arxiv preprint arxiv …, 2020 - arxiv.org
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

Celebrating diversity in shared multi-agent reinforcement learning

C Li, T Wang, C Wu, Q Zhao… - Advances in Neural …, 2021 - proceedings.neurips.cc
Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve
complex cooperative tasks. Its success is partly because of parameter sharing among …

Facmac: Factored multi-agent centralised policy gradients

B Peng, T Rashid… - Advances in …, 2021 - proceedings.neurips.cc
Abstract We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new
method for cooperative multi-agent reinforcement learning in both discrete and continuous …

Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning

T Zhang, Y Li, C Wang, G **e… - … conference on machine …, 2021 - proceedings.mlr.press
Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods.
However, existing decomposed actor-critic methods cannot guarantee the convergence of …