- Academic Search

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc

Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

Speichern Zitieren Zitiert von: 447 Ähnliche Artikel Alle 11 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arxiv preprint arxiv …, 2020 - arxiv.org

Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

Speichern Zitieren Zitiert von: 235 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press

VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

Speichern Zitieren Zitiert von: 55 Ähnliche Artikel Alle 11 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

Tesseract: Tensorised actors for multi-agent reinforcement learning

A Mahajan, M Samvelyan, L Mao… - International …, 2021 - proceedings.mlr.press

Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …

Speichern Zitieren Zitiert von: 43 Ähnliche Artikel Alle 10 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Virel: A variational inference framework for reinforcement learning

M Fellows, A Mahajan, TGJ Rudner… - Advances in neural …, 2019 - proceedings.neurips.cc

Applying probabilistic models to reinforcement learning (RL) enables the uses of powerful
optimisation tools such as variational inference in RL. However, existing inference …

Speichern Zitieren Zitiert von: 63 Ähnliche Artikel Alle 15 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Invariant transform experience replay: Data augmentation for deep reinforcement learning

Y Lin, J Huang, M Zimmer, Y Guan… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org

Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but
its current application to robotics is currently hindered by high sample requirements. To …

Speichern Zitieren Zitiert von: 50 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] arxiv.org

Addressing imperfect symmetry: a novel symmetry-learning actor-critic extension

M Abreu, LP Reis, N Lau - arxiv preprint arxiv:2309.02711, 2023 - arxiv.org

Symmetry, a fundamental concept to understand our environment, often oversimplifies
reality from a mathematical perspective. Humans are a prime example, deviating from …

Speichern Zitieren Zitiert von: 6 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Knowledge-guided exploration in deep reinforcement learning

S Mazumder, B Liu, S Wang, Y Zhu, X Yin… - arxiv preprint arxiv …, 2022 - arxiv.org

This paper proposes a new method to drastically speed up deep reinforcement learning
(deep RL) training for problems that have the property of state-action permissibility (SAP) …

Speichern Zitieren Zitiert von: 3 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] kdd.org

[PDF][PDF] Action permissibility in deep reinforcement learning and application to autonomous driving

S Mazumder, B Liu, S Wang, Y Zhu, L Liu… - KDD'18 Deep Learning …, 2018 - kdd.org

This paper is concerned with deep reinforcement learning (deep RL) in continuous state and
action space. It proposes a new method that can drastically speed up RL training for …

Speichern Zitieren Zitiert von: 9 Ähnliche Artikel HTML-Version

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension

M Abreu, LP Reis, N Lau - Neurocomputing, 2025 - Elsevier

Symmetry, a fundamental concept to understand our environment, often oversimplifies
reality from a mathematical perspective. Humans are a prime example, deviating from …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Symmetry detection and exploitation for function approximation in deep rl

Maven: Multi-agent variational exploration

Rode: Learning roles to decompose multi-agent tasks

Uneven: Universal value exploration for multi-agent reinforcement learning

Tesseract: Tensorised actors for multi-agent reinforcement learning

Virel: A variational inference framework for reinforcement learning

Invariant transform experience replay: Data augmentation for deep reinforcement learning

Addressing imperfect symmetry: a novel symmetry-learning actor-critic extension

Knowledge-guided exploration in deep reinforcement learning

[PDF][PDF] Action permissibility in deep reinforcement learning and application to autonomous driving

[HTML][HTML] Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension