Multi-agent reinforcement learning: A selective overview of theories and algorithms
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …
has registered tremendous success in solving various sequential decision-making problems …
Deep multiagent reinforcement learning: Challenges and directions
This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …
combination of deep neural networks with RL has gained increased traction in recent years …
A survey and critique of multiagent deep reinforcement learning
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
A unified game-theoretic approach to multiagent reinforcement learning
There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …
partly to the recent success of deep neural networks. The simplest form of MARL is …
[LIVRE][B] A concise introduction to decentralized POMDPs
FA Oliehoek, C Amato - 2016 - Springer
This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …
cooperative systems. It is aimed at graduate students and researchers in the fields of …
[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
[HTML][HTML] Risk-aware shielding of partially observable monte carlo planning policies
Abstract Partially Observable Monte Carlo Planning (POMCP) is a powerful online algorithm
that can generate approximate policies for large Partially Observable Markov Decision …
that can generate approximate policies for large Partially Observable Markov Decision …
Safe policy synthesis in multi-agent POMDPs via discrete-time barrier functions
A multi-agent partially observable Markov decision process (MPOMDP) is a modeling
paradigm used for high-level planning of heterogeneous autonomous agents subject to …
paradigm used for high-level planning of heterogeneous autonomous agents subject to …
Learning in POMDPs with Monte Carlo tree search
The POMDP is a powerful framework for reasoning under outcome and information
uncertainty, but constructing an accurate POMDP model is difficult. Bayes-Adaptive Partially …
uncertainty, but constructing an accurate POMDP model is difficult. Bayes-Adaptive Partially …
Cooperative traffic signal control using multi-step return and off-policy asynchronous advantage actor-critic graph algorithm
Intelligent traffic signal control helps to reduce traffic congestion and thus has been studied
for a few decades. Multi-intersection cooperative traffic signal control (CTSC), which is more …
for a few decades. Multi-intersection cooperative traffic signal control (CTSC), which is more …