A survey on reinforcement learning methods in character animation
Reinforcement Learning is an area of Machine Learning focused on how agents can be
trained to make sequential decisions, and achieve a particular goal within an arbitrary …
trained to make sequential decisions, and achieve a particular goal within an arbitrary …
The surprising effectiveness of ppo in cooperative multi-agent games
Abstract Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …
A review of cooperation in multi-agent learning
Cooperation in multi-agent learning (MAL) is a topic at the intersection of numerous
disciplines, including game theory, economics, social sciences, and evolutionary biology …
disciplines, including game theory, economics, social sciences, and evolutionary biology …
Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …
[書籍][B] Multi-agent reinforcement learning: Foundations and modern approaches
The first comprehensive introduction to Multi-Agent Reinforcement Learning (MARL),
covering MARL's models, solution concepts, algorithmic ideas, technical challenges, and …
covering MARL's models, solution concepts, algorithmic ideas, technical challenges, and …
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning
QMIX is a popular $ Q $-learning algorithm for cooperative MARL in the centralised training
and decentralised execution paradigm. In order to enable easy decentralisation, QMIX …
and decentralised execution paradigm. In order to enable easy decentralisation, QMIX …
Rode: Learning roles to decompose multi-agent tasks
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …
Celebrating diversity in shared multi-agent reinforcement learning
Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve
complex cooperative tasks. Its success is partly because of parameter sharing among …
complex cooperative tasks. Its success is partly because of parameter sharing among …
Facmac: Factored multi-agent centralised policy gradients
Abstract We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new
method for cooperative multi-agent reinforcement learning in both discrete and continuous …
method for cooperative multi-agent reinforcement learning in both discrete and continuous …
Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning
Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods.
However, existing decomposed actor-critic methods cannot guarantee the convergence of …
However, existing decomposed actor-critic methods cannot guarantee the convergence of …