- Academic Search

A Kwiatkowski, E Alvarado, V Kalogeiton… - Computer Graphics …, 2022 - Wiley Online Library

Reinforcement Learning is an area of Machine Learning focused on how agents can be
trained to make sequential decisions, and achieve a particular goal within an arbitrary …

保存引用被引用数: 50 関連記事全 14 バージョン

[Free GPT-4]

[PDF] neurips.cc

The surprising effectiveness of ppo in cooperative multi-agent games

C Yu, A Velu, E Vinitsky, J Gao… - Advances in …, 2022 - proceedings.neurips.cc

Abstract Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …

保存引用被引用数: 1551 関連記事全 12 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

A review of cooperation in multi-agent learning

Y Du, JZ Leibo, U Islam, R Willis, P Sunehag - arxiv preprint arxiv …, 2023 - arxiv.org

Cooperation in multi-agent learning (MAL) is a topic at the intersection of numerous
disciplines, including game theory, economics, social sciences, and evolutionary biology …

保存引用被引用数: 31 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc

The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

保存引用被引用数: 92 関連記事全 6 バージョン HTMLバージョン

[書籍][B] Multi-agent reinforcement learning: Foundations and modern approaches

SV Albrecht, F Christianos, L Schäfer - 2024 - books.google.com

The first comprehensive introduction to Multi-Agent Reinforcement Learning (MARL),
covering MARL's models, solution concepts, algorithmic ideas, technical challenges, and …

保存引用被引用数: 135 関連記事図書館検索

[Free GPT-4]

[PDF] neurips.cc

Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning

T Rashid, G Farquhar, B Peng… - Advances in neural …, 2020 - proceedings.neurips.cc

QMIX is a popular $ Q $-learning algorithm for cooperative MARL in the centralised training
and decentralised execution paradigm. In order to enable easy decentralisation, QMIX …

保存引用被引用数: 401 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arxiv preprint arxiv …, 2020 - arxiv.org

Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

保存引用被引用数: 237 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Celebrating diversity in shared multi-agent reinforcement learning

C Li, T Wang, C Wu, Q Zhao… - Advances in Neural …, 2021 - proceedings.neurips.cc

Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve
complex cooperative tasks. Its success is partly because of parameter sharing among …

保存引用被引用数: 170 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Facmac: Factored multi-agent centralised policy gradients

B Peng, T Rashid… - Advances in …, 2021 - proceedings.neurips.cc

Abstract We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new
method for cooperative multi-agent reinforcement learning in both discrete and continuous …

保存引用被引用数: 258 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mlr.press

Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning

T Zhang, Y Li, C Wang, G **e… - … conference on machine …, 2021 - proceedings.mlr.press

Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods.
However, existing decomposed actor-critic methods cannot guarantee the convergence of …

保存引用被引用数: 93 関連記事全 3 バージョン HTMLバージョン

引用

検索オプション

マイライブラリに保存しました

A survey on reinforcement learning methods in character animation

The surprising effectiveness of ppo in cooperative multi-agent games

A review of cooperation in multi-agent learning

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

[書籍][B] Multi-agent reinforcement learning: Foundations and modern approaches

Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning

Rode: Learning roles to decompose multi-agent tasks

Celebrating diversity in shared multi-agent reinforcement learning

Facmac: Factored multi-agent centralised policy gradients

Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning