Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y **e, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press
Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Learning with opponent-learning awareness

JN Foerster, RY Chen, M Al-Shedivat… - arxiv preprint arxiv …, 2017 - arxiv.org
Multi-agent settings are quickly gathering importance in machine learning. This includes a
plethora of recent work on deep multi-agent reinforcement learning, but also can be …

Autonomous agents modelling other agents: A comprehensive survey and open problems

SV Albrecht, P Stone - Artificial Intelligence, 2018 - Elsevier
Much research in artificial intelligence is concerned with the development of autonomous
agents that can interact effectively with other agents. An important aspect of such agents is …

Stabilising experience replay for deep multi-agent reinforcement learning

J Foerster, N Nardelli, G Farquhar… - International …, 2017 - proceedings.mlr.press
Many real-world problems, such as network packet routing and urban traffic control, are
naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing …

Continuous adaptation via meta-learning in nonstationary and competitive environments

M Al-Shedivat, T Bansal, Y Burda, I Sutskever… - arxiv preprint arxiv …, 2017 - arxiv.org
Ability to continuously learn and adapt from limited experience in nonstationary
environments is an important milestone on the path towards general intelligence. In this …

A survey of learning in multiagent environments: Dealing with non-stationarity

P Hernandez-Leal, M Kaisers, T Baarslag… - arxiv preprint arxiv …, 2017 - arxiv.org
The key challenge in multiagent learning is learning a best response to the behaviour of
other agents, which may be non-stationary: if the other agents adapt their strategy as well …

Last-iterate convergence of decentralized optimistic gradient descent/ascent in infinite-horizon competitive markov games

CY Wei, CW Lee, M Zhang… - Conference on learning …, 2021 - proceedings.mlr.press
We study infinite-horizon discounted two-player zero-sum Markov games, and develop a
decentralized algorithm that provably converges to the set of Nash equilibria under self-play …