- Academic Search

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Gem Citer Citeret af 1716 Relaterede artikler Alle 8 versioner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Gem Citer Citeret af 352 Relaterede artikler Alle 2 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y **e, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press

Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

Gem Citer Citeret af 868 Relaterede artikler Alle 9 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Gem Citer Citeret af 704 Relaterede artikler Alle 8 versioner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning with opponent-learning awareness

JN Foerster, RY Chen, M Al-Shedivat… - arxiv preprint arxiv …, 2017 - arxiv.org

Multi-agent settings are quickly gathering importance in machine learning. This includes a
plethora of recent work on deep multi-agent reinforcement learning, but also can be …

Gem Citer Citeret af 652 Relaterede artikler Alle 13 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Autonomous agents modelling other agents: A comprehensive survey and open problems

SV Albrecht, P Stone - Artificial Intelligence, 2018 - Elsevier

Much research in artificial intelligence is concerned with the development of autonomous
agents that can interact effectively with other agents. An important aspect of such agents is …

Gem Citer Citeret af 622 Relaterede artikler Alle 10 versioner

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Stabilising experience replay for deep multi-agent reinforcement learning

J Foerster, N Nardelli, G Farquhar… - International …, 2017 - proceedings.mlr.press

Many real-world problems, such as network packet routing and urban traffic control, are
naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing …

Gem Citer Citeret af 793 Relaterede artikler Alle 12 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Continuous adaptation via meta-learning in nonstationary and competitive environments

M Al-Shedivat, T Bansal, Y Burda, I Sutskever… - arxiv preprint arxiv …, 2017 - arxiv.org

Ability to continuously learn and adapt from limited experience in nonstationary
environments is an important milestone on the path towards general intelligence. In this …

Gem Citer Citeret af 431 Relaterede artikler Alle 8 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of learning in multiagent environments: Dealing with non-stationarity

P Hernandez-Leal, M Kaisers, T Baarslag… - arxiv preprint arxiv …, 2017 - arxiv.org

The key challenge in multiagent learning is learning a best response to the behaviour of
other agents, which may be non-stationary: if the other agents adapt their strategy as well …

Gem Citer Citeret af 369 Relaterede artikler Alle 5 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Last-iterate convergence of decentralized optimistic gradient descent/ascent in infinite-horizon competitive markov games

CY Wei, CW Lee, M Zhang… - Conference on learning …, 2021 - proceedings.mlr.press

We study infinite-horizon discounted two-player zero-sum Markov games, and develop a
decentralized algorithm that provably converges to the set of Nash equilibria under self-play …

Gem Citer Citeret af 114 Relaterede artikler Alle 4 versioner Vis som HTML

Opret underretning

Citer

Avanceret søgning

Gemt i Min samling

AWESOME: A general multiagent learning algorithm that converges in self-play and learns a...

Multi-agent reinforcement learning: A selective overview of theories and algorithms

An overview of multi-agent reinforcement learning from game theoretical perspective

A theoretical analysis of deep Q-learning

A survey and critique of multiagent deep reinforcement learning

Learning with opponent-learning awareness

Autonomous agents modelling other agents: A comprehensive survey and open problems

Stabilising experience replay for deep multi-agent reinforcement learning

Continuous adaptation via meta-learning in nonstationary and competitive environments

A survey of learning in multiagent environments: Dealing with non-stationarity

Last-iterate convergence of decentralized optimistic gradient descent/ascent in infinite-horizon competitive markov games