Evolutionary dynamics of multi-agent learning: A survey

D Bloembergen, K Tuyls, D Hennes… - Journal of Artificial …, 2015 - jair.org
The interaction of multiple autonomous agents gives rise to highly dynamic and
nondeterministic environments, contributing to the complexity in applications such as …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

A survey of learning in multiagent environments: Dealing with non-stationarity

P Hernandez-Leal, M Kaisers, T Baarslag… - arxiv preprint arxiv …, 2017 - arxiv.org
The key challenge in multiagent learning is learning a best response to the behaviour of
other agents, which may be non-stationary: if the other agents adapt their strategy as well …

From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization

J Perolat, R Munos, JB Lespiau… - International …, 2021 - proceedings.mlr.press
In this paper we investigate the Follow the Regularized Leader dynamics in sequential
imperfect information games (IIG). We generalize existing results of Poincar {é} recurrence …

[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey

P Hernandez-Leal, B Kartal, ME Taylor - learning, 2018 - researchgate.net
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory

S Leonardos, G Piliouras - Artificial Intelligence, 2022 - Elsevier
Exploration-exploitation is a powerful and practical tool in multi-agent learning (MAL);
however, its effects are far from understood. To make progress in this direction, we study a …

Exploration-exploitation in multi-agent competition: convergence with bounded rationality

S Leonardos, G Piliouras… - Advances in Neural …, 2021 - proceedings.neurips.cc
The interplay between exploration and exploitation in competitive multi-agent learning is still
far from being well understood. Motivated by this, we study smooth Q-learning, a prototypical …

Dynamics of Boltzmann learning in two-player two-action games

A Kianercy, A Galstyan - Physical Review E—Statistical, Nonlinear, and Soft …, 2012 - APS
We consider the dynamics of Q learning in two-player two-action games with a Boltzmann
exploration mechanism. For any nonzero exploration rate the dynamics is dissipative, which …

Dynamical systems as a level of cognitive analysis of multi-agent learning: Algorithmic foundations of temporal-difference learning dynamics

W Barfuss - Neural Computing and Applications, 2022 - Springer
A dynamical systems perspective on multi-agent learning, based on the link between
evolutionary game theory and reinforcement learning, provides an improved, qualitative …

[PDF][PDF] Beating Price of Anarchy and Gradient Descent without Regret in Potential Games

J Sakos, S Leonardos, S Stavroulakis… - The Twelfth …, 2024 - kclpure.kcl.ac.uk
Arguably one of the thorniest problems in game theory is that of equilibrium selection.
Specifically, in the presence of multiple equilibria do self-interested learning dynamics …