A survey on model-based reinforcement learning
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …
making problems via a trial-and-error approach. Errors are always undesirable in real-world …
[BOOK][B] A concise introduction to decentralized POMDPs
FA Oliehoek, C Amato - 2016 - Springer
This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …
cooperative systems. It is aimed at graduate students and researchers in the fields of …
A survey and critique of multiagent deep reinforcement learning
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
Influence-based multi-agent exploration
Intrinsically motivated reinforcement learning aims to address the exploration challenge for
sparse-reward tasks. However, the study of exploration methods in transition-dependent …
sparse-reward tasks. However, the study of exploration methods in transition-dependent …
Multi-objective multi-agent decision making: a utility-based analysis and survey
The majority of multi-agent system implementations aim to optimise agents' policies with
respect to a single objective, despite the fact that many real-world problem domains are …
respect to a single objective, despite the fact that many real-world problem domains are …
[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms
In domains such as electric vehicle charging, smart distribution grids and autonomous
warehouses, multiple agents share the same resources. When planning the use of these …
warehouses, multiple agents share the same resources. When planning the use of these …
Distributed heuristic forward search for multi-agent planning
This paper deals with the problem of classical planning for multiple cooperative agents who
have private information about their local state and capabilities they do not want to reveal …
have private information about their local state and capabilities they do not want to reveal …
Who needs to know? minimal knowledge for optimal coordination
To optimally coordinate with others in cooperative games, it is often crucial to have
information about one's collaborators: successful driving requires understanding which side …
information about one's collaborators: successful driving requires understanding which side …
Incremental clustering and expansion for faster optimal planning in Dec-POMDPs
This article presents the state-of-the-art in optimal solution methods for decentralized
partially observable Markov decision processes (Dec-POMDPs), which are general models …
partially observable Markov decision processes (Dec-POMDPs), which are general models …