Evolutionary dynamics of multi-agent learning: A survey
The interaction of multiple autonomous agents gives rise to highly dynamic and
nondeterministic environments, contributing to the complexity in applications such as …
nondeterministic environments, contributing to the complexity in applications such as …
A survey and critique of multiagent deep reinforcement learning
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
A survey of learning in multiagent environments: Dealing with non-stationarity
The key challenge in multiagent learning is learning a best response to the behaviour of
other agents, which may be non-stationary: if the other agents adapt their strategy as well …
other agents, which may be non-stationary: if the other agents adapt their strategy as well …
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
In this paper we investigate the Follow the Regularized Leader dynamics in sequential
imperfect information games (IIG). We generalize existing results of Poincar {é} recurrence …
imperfect information games (IIG). We generalize existing results of Poincar {é} recurrence …
[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory
S Leonardos, G Piliouras - Artificial Intelligence, 2022 - Elsevier
Exploration-exploitation is a powerful and practical tool in multi-agent learning (MAL);
however, its effects are far from understood. To make progress in this direction, we study a …
however, its effects are far from understood. To make progress in this direction, we study a …
Exploration-exploitation in multi-agent competition: convergence with bounded rationality
S Leonardos, G Piliouras… - Advances in Neural …, 2021 - proceedings.neurips.cc
The interplay between exploration and exploitation in competitive multi-agent learning is still
far from being well understood. Motivated by this, we study smooth Q-learning, a prototypical …
far from being well understood. Motivated by this, we study smooth Q-learning, a prototypical …
Dynamics of Boltzmann learning in two-player two-action games
A Kianercy, A Galstyan - Physical Review E—Statistical, Nonlinear, and Soft …, 2012 - APS
We consider the dynamics of Q learning in two-player two-action games with a Boltzmann
exploration mechanism. For any nonzero exploration rate the dynamics is dissipative, which …
exploration mechanism. For any nonzero exploration rate the dynamics is dissipative, which …
Dynamical systems as a level of cognitive analysis of multi-agent learning: Algorithmic foundations of temporal-difference learning dynamics
W Barfuss - Neural Computing and Applications, 2022 - Springer
A dynamical systems perspective on multi-agent learning, based on the link between
evolutionary game theory and reinforcement learning, provides an improved, qualitative …
evolutionary game theory and reinforcement learning, provides an improved, qualitative …
[PDF][PDF] Beating Price of Anarchy and Gradient Descent without Regret in Potential Games
Arguably one of the thorniest problems in game theory is that of equilibrium selection.
Specifically, in the presence of multiple equilibria do self-interested learning dynamics …
Specifically, in the presence of multiple equilibria do self-interested learning dynamics …