- Academic Search

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc

There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …

保存引用被引用数: 809 関連記事全 15 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-agent reinforcement learning in sequential social dilemmas

JZ Leibo, V Zambaldi, M Lanctot, J Marecki… - arxiv preprint arxiv …, 2017 - arxiv.org

Matrix games like Prisoner's Dilemma have guided research on social dilemmas for
decades. However, they necessarily treat the choice to cooperate or defect as an atomic …

保存引用被引用数: 960 関連記事全 14 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

OpenSpiel: A framework for reinforcement learning in games

M Lanctot, E Lockhart, JB Lespiau, V Zambaldi… - arxiv preprint arxiv …, 2019 - arxiv.org

OpenSpiel is a collection of environments and algorithms for research in general
reinforcement learning and search/planning in games. OpenSpiel supports n-player (single …

保存引用被引用数: 301 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Actor-critic policy optimization in partially observable multiagent environments

S Srinivasan, M Lanctot, V Zambaldi… - Advances in neural …, 2018 - proceedings.neurips.cc

Optimization of parameterized policies for reinforcement learning (RL) is an important and
challenging problem in artificial intelligence. Among the most common approaches are …

保存引用被引用数: 174 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Actor-critic fictitious play in simultaneous move multistage games

J Perolat, B Piot, O Pietquin - International Conference on …, 2018 - proceedings.mlr.press

Fictitious play is a game theoretic iterative procedure meant to learn an equilibrium in
normal form games. However, this algorithm requires that each player has full knowledge of …

保存引用被引用数: 82 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Multi-view decision processes: the helper-ai problem

C Dimitrakakis, DC Parkes… - Advances in neural …, 2017 - proceedings.neurips.cc

We consider a two-player sequential game in which agents have the same reward function
but may disagree on the transition probabilities of an underlying Markovian model of the …

保存引用被引用数: 32 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

Research and implementation of intelligent decision based on a priori knowledge and DQN algorithms in wargame environment

Y Sun, B Yuan, T Zhang, B Tang, W Zheng, X Zhou - Electronics, 2020 - mdpi.com

The reinforcement learning problem of complex action control in a multi-player wargame has
been a hot research topic in recent years. In this paper, a game system based on turn-based …

保存引用被引用数: 21 関連記事全 5 バージョンキャッシュ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SC-PSRO: A Unified Strategy Learning Method for Normal-form Games

Y Hu, H Li, C Han, T Guo, M Li, B Li - arxiv preprint arxiv:2308.12520, 2023 - arxiv.org

Solving Nash equilibrium is the key challenge in normal-form games with large strategy
spaces, wherein open-ended learning framework provides an efficient approach. Previous …

保存引用被引用数: 1 関連記事全 2 バージョン HTMLバージョン

An algorithm for constructing and solving imperfect recall abstractions of large extensive-form games

J Čermák, B Bošansky, V Lisy - … of the 26th International Joint Conference …, 2017 - dl.acm.org

We solve large two-player zero-sum extensive-form games with perfect recall. We propose a
new algorithm based on fictitious play that significantly reduces memory requirements for …

保存引用被引用数: 21 関連記事

[Free GPT-4]
[DeepSeek]

[PDF] fransoliehoek.net

[PDF][PDF] Markov security games: Learning in spatial security problems

R Klima, K Tuyls, F Oliehoek - … inference and control of multi-agent …, 2016 - fransoliehoek.net

In this paper we present a preliminary investigation of modelling spatial aspects of security
games within the context of Markov games. Reinforcement learning is a powerful tool for …

保存引用被引用数: 22 関連記事全 3 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Algorithms for computing strategies in two-player simultaneous move games

A unified game-theoretic approach to multiagent reinforcement learning

Multi-agent reinforcement learning in sequential social dilemmas

OpenSpiel: A framework for reinforcement learning in games

Actor-critic policy optimization in partially observable multiagent environments

Actor-critic fictitious play in simultaneous move multistage games

Multi-view decision processes: the helper-ai problem

Research and implementation of intelligent decision based on a priori knowledge and DQN algorithms in wargame environment

SC-PSRO: A Unified Strategy Learning Method for Normal-form Games

An algorithm for constructing and solving imperfect recall abstractions of large extensive-form games

[PDF][PDF] Markov security games: Learning in spatial security problems