The role of information structures in game-theoretic multi-agent learning

T Li, Y Zhao, Q Zhu - Annual Reviews in Control, 2022 - Elsevier
Multi-agent learning (MAL) studies how agents learn to behave optimally and adaptively
from their experience when interacting with other agents in dynamic environments. The …

Decentralized Q-learning in zero-sum Markov games

M Sayin, K Zhang, D Leslie, T Basar… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study multi-agent reinforcement learning (MARL) in infinite-horizon discounted zero-sum
Markov games. We focus on the practical but challenging setting of decentralized MARL …

Independent learning in stochastic games

A Ozdaglar, MO Sayin, K Zhang - International Congress of …, 2021 - ems.press
Reinforcement learning (RL) has recently achieved tremendous successes in many artificial
intelligence applications. Many of the forefront applications of RL involve multiple agents …

Last-iterate convergence of decentralized optimistic gradient descent/ascent in infinite-horizon competitive Markov games

CY Wei, CW Lee, M Zhang… - Conference on learning …, 2021 - proceedings.mlr.press
We study infinite-horizon discounted two-player zero-sum Markov games, and develop a
decentralized algorithm that provably converges to the set of Nash equilibria under self-play …

The confluence of networks, games, and learning a game-theoretic framework for multiagent decision making over networks

T Li, G Peng, Q Zhu, T Başar - IEEE Control Systems Magazine, 2022 - ieeexplore.ieee.org
Multiagent decision making over networks has recently attracted an exponentially growing
number of researchers from the systems and control community. The area has gained …

Learning Equilibria in Adversarial Team Markov Games: A Nonconvex-Hidden-Concave Min-Max Optimization Problem

F Kalogiannis, J Yan… - Advances in Neural …, 2025 - proceedings.neurips.cc
We study the problem of learning a Nash equilibrium (NE) in Markov games which is a
cornerstone in multi-agent reinforcement learning (MARL). In particular, we focus on infinite …

On the convergence of policy gradient methods to Nash equilibria in general stochastic games

A Giannou, K Lotidis, P Mertikopoulos… - Advances in …, 2022 - proceedings.neurips.cc
Learning in stochastic games is a notoriously difficult problem because, in addition to each
other's strategic decisions, the players must also contend with the fact that the game itself …

Zero-sum polymatrix markov games: Equilibrium collapse and efficient computation of nash equilibria

F Kalogiannis, I Panageas - Advances in Neural …, 2023 - proceedings.neurips.cc
The works of (Daskalakis et al., 2009, 2022; ** et al., 2022; Deng et al., 2023) indicate that
computing Nash equilibria in multi-player Markov games is a computationally hard task. This …

Multi-player zero-sum markov games with networked separable interactions

C Park, K Zhang, A Ozdaglar - Advances in Neural …, 2023 - proceedings.neurips.cc
We study a new class of Markov games,\textit {(multi-player) zero-sum Markov Games} with
{\it Networked separable interactions}(zero-sum NMGs), to model the local interaction …

A finite-sample analysis of payoff-based independent learning in zero-sum stochastic games

Z Chen, K Zhang, E Mazumdar… - Advances in …, 2023 - proceedings.neurips.cc
In this work, we study two-player zero-sum stochastic games and develop a variant of the
smoothed best-response learning dynamics that combines independent learning dynamics …