Multi-agent reinforcement learning: An overview

L Buşoniu, R Babuška, B De Schutter - Innovations in multi-agent systems …, 2010 - Springer
Multi-agent systems can be used to address problems in a variety of domains, including
robotics, distributed control, telecommunications, and economics. The complexity of many …

A comprehensive survey of multiagent reinforcement learning

L Busoniu, R Babuska… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org
Multiagent systems are rapidly finding applications in a variety of domains, including
robotics, distributed control, telecommunications, and economics. The complexity of many …

RL/DRL meets vehicular task offloading using edge and vehicular cloudlet: A survey

J Liu, M Ahmed, MA Mirza, WU Khan… - IEEE Internet of …, 2022 - ieeexplore.ieee.org
The last two decades have seen a clear trend toward crafting intelligent vehicles based on
the significant advances in communication and computing paradigms, which provide a safer …

[PDF][PDF] Collaborative multiagent reinforcement learning by payoff propagation

JR Kok, N Vlassis - Journal of machine learning research, 2006 - jmlr.org
In this article we describe a set of scalable techniques for learning the behavior of a group of
agents in a collaborative multiagent setting. As a basis we use the framework of coordination …

-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through

S Kar, JMF Moura, HV Poor - IEEE Transactions on Signal …, 2013 - ieeexplore.ieee.org
The paper develops QD-learning, a distributed version of reinforcement Q-learning, for multi-
agent Markov decision processes (MDPs); the agents have no prior information on the …

Decentralized multi-robot cooperation with auctioned POMDPs

J Capitan, MTJ Spaan, L Merino… - … International Journal of …, 2013 - journals.sagepub.com
Planning under uncertainty faces a scalability problem when considering multi-robot teams,
as the information space scales exponentially with the number of robots. To address this …

Multi-agent reinforcement learning: A survey

L Busoniu, R Babuska… - 2006 9th international …, 2006 - ieeexplore.ieee.org
Multi-agent systems are rapidly finding applications in a variety of domains, including
robotics, distributed control, telecommunications, economics. Many tasks arising in these …

Using the max-plus algorithm for multiagent decision making in coordination graphs

JR Kok, N Vlassis - RoboCup 2005: Robot Soccer World Cup IX 9, 2006 - Springer
Coordination graphs offer a tractable framework for cooperative multiagent decision making
by decomposing the global payoff function into a sum of local terms. Each agent can in …

A decomposition approach to multi-vehicle cooperative control

MG Earl, R D'Andrea - Robotics and Autonomous Systems, 2007 - Elsevier
We use a decomposition approach to generate cooperative strategies for a class of multi-
vehicle control problems. By introducing a set of tasks to be completed by the team of …

Utile coordination: Learning interdependencies among cooperative agents

JR Kok, EJ Hoen, B Bakker, N Vlassis - EEE Symp. on Computational …, 2005 - orbilu.uni.lu
We describe Utile Coordination, an algorithm that allows a multiagent system to learn where
and how to coordinate. The method starts with uncoordinated learners and maintains …