A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

[BOOK][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer
This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Influence-based multi-agent exploration

T Wang, J Wang, Y Wu, C Zhang - arxiv preprint arxiv:1910.05512, 2019 - arxiv.org
Intrinsically motivated reinforcement learning aims to address the exploration challenge for
sparse-reward tasks. However, the study of exploration methods in transition-dependent …

Multi-objective multi-agent decision making: a utility-based analysis and survey

R Rădulescu, P Mannion, DM Roijers… - Autonomous Agents and …, 2020 - Springer
The majority of multi-agent system implementations aim to optimise agents' policies with
respect to a single objective, despite the fact that many real-world problem domains are …

[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey

P Hernandez-Leal, B Kartal, ME Taylor - learning, 2018 - researchgate.net
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms

F De Nijs, E Walraven, M De Weerdt, M Spaan - Journal of Artificial …, 2021 - jair.org
In domains such as electric vehicle charging, smart distribution grids and autonomous
warehouses, multiple agents share the same resources. When planning the use of these …

Distributed heuristic forward search for multi-agent planning

R Nissim, R Brafman - Journal of Artificial Intelligence Research, 2014 - jair.org
This paper deals with the problem of classical planning for multiple cooperative agents who
have private information about their local state and capabilities they do not want to reveal …

Who needs to know? minimal knowledge for optimal coordination

N Lauffer, A Shah, M Carroll… - International …, 2023 - proceedings.mlr.press
To optimally coordinate with others in cooperative games, it is often crucial to have
information about one's collaborators: successful driving requires understanding which side …

Incremental clustering and expansion for faster optimal planning in Dec-POMDPs

FA Oliehoek, MTJ Spaan, C Amato… - Journal of Artificial …, 2013 - jair.org
This article presents the state-of-the-art in optimal solution methods for decentralized
partially observable Markov decision processes (Dec-POMDPs), which are general models …