A unified game-theoretic approach to multiagent reinforcement learning

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc
There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …

Multi-agent reinforcement learning in sequential social dilemmas

JZ Leibo, V Zambaldi, M Lanctot, J Marecki… - arxiv preprint arxiv …, 2017 - arxiv.org
Matrix games like Prisoner's Dilemma have guided research on social dilemmas for
decades. However, they necessarily treat the choice to cooperate or defect as an atomic …

OpenSpiel: A framework for reinforcement learning in games

M Lanctot, E Lockhart, JB Lespiau, V Zambaldi… - arxiv preprint arxiv …, 2019 - arxiv.org
OpenSpiel is a collection of environments and algorithms for research in general
reinforcement learning and search/planning in games. OpenSpiel supports n-player (single …

Actor-critic policy optimization in partially observable multiagent environments

S Srinivasan, M Lanctot, V Zambaldi… - Advances in neural …, 2018 - proceedings.neurips.cc
Optimization of parameterized policies for reinforcement learning (RL) is an important and
challenging problem in artificial intelligence. Among the most common approaches are …

Actor-critic fictitious play in simultaneous move multistage games

J Perolat, B Piot, O Pietquin - International Conference on …, 2018 - proceedings.mlr.press
Fictitious play is a game theoretic iterative procedure meant to learn an equilibrium in
normal form games. However, this algorithm requires that each player has full knowledge of …

Multi-view decision processes: the helper-ai problem

C Dimitrakakis, DC Parkes… - Advances in neural …, 2017 - proceedings.neurips.cc
We consider a two-player sequential game in which agents have the same reward function
but may disagree on the transition probabilities of an underlying Markovian model of the …

Research and implementation of intelligent decision based on a priori knowledge and DQN algorithms in wargame environment

Y Sun, B Yuan, T Zhang, B Tang, W Zheng, X Zhou - Electronics, 2020 - mdpi.com
The reinforcement learning problem of complex action control in a multi-player wargame has
been a hot research topic in recent years. In this paper, a game system based on turn-based …

SC-PSRO: A Unified Strategy Learning Method for Normal-form Games

Y Hu, H Li, C Han, T Guo, M Li, B Li - arxiv preprint arxiv:2308.12520, 2023 - arxiv.org
Solving Nash equilibrium is the key challenge in normal-form games with large strategy
spaces, wherein open-ended learning framework provides an efficient approach. Previous …

An algorithm for constructing and solving imperfect recall abstractions of large extensive-form games

J Čermák, B Bošansky, V Lisy - … of the 26th International Joint Conference …, 2017 - dl.acm.org
We solve large two-player zero-sum extensive-form games with perfect recall. We propose a
new algorithm based on fictitious play that significantly reduces memory requirements for …

[PDF][PDF] Markov security games: Learning in spatial security problems

R Klima, K Tuyls, F Oliehoek - … inference and control of multi-agent …, 2016 - fransoliehoek.net
In this paper we present a preliminary investigation of modelling spatial aspects of security
games within the context of Markov games. Reinforcement learning is a powerful tool for …