Академия Google

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Сохранить Цитировать Цитируется: 1710 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Сохранить Цитировать Цитируется: 351 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A modern introduction to online learning

F Orabona - arxiv preprint arxiv:1912.13213, 2019 - arxiv.org

In this monograph, I introduce the basic concepts of Online Learning through a modern view
of Online Convex Optimization. Here, online learning refers to the framework of regret …

Сохранить Цитировать Цитируется: 432 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] archive.org

[КНИГА][B] Partially observed Markov decision processes

V Krishnamurthy - 2016 - books.google.com

Covering formulation, algorithms, and structural results, and linking theory to real-world
applications in controlled sensing (including social learning, adaptive radars and sequential …

Сохранить Цитировать Цитируется: 477 Похожие статьи Все версии статьи (5) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Potential games

D Monderer, LS Shapley - Games and economic behavior, 1996 - Elsevier

Potential Games Page 1 GAMES AND ECONOMIC BEHAVIOR 14, 124–143 (1996)
ARTICLE NO. 0044 Potential Games Dov Monderer ∗ Faculty of Industrial Engineering and …

Сохранить Цитировать Цитируется: 5437 Похожие статьи Все версии статьи (31)

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[КНИГА][B] Prediction, learning, and games

N Cesa-Bianchi, G Lugosi - 2006 - books.google.com

This important text and reference for researchers and students in machine learning, game
theory, statistics and information theory offers a comprehensive treatment of the problem of …

Сохранить Цитировать Цитируется: 5199 Похожие статьи Все версии статьи (13) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

The nonstochastic multiarmed bandit problem

P Auer, N Cesa-Bianchi, Y Freund, RE Schapire - SIAM journal on computing, 2002 - SIAM

In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot
machines to play in a sequence of trials so as to maximize his reward. This classical …

Сохранить Цитировать Цитируется: 3255 Похожие статьи Все версии статьи (29)

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

[PDF][PDF] Online convex programming and generalized infinitesimal gradient ascent

M Zinkevich - Proceedings of the 20th international conference on …, 2003 - cdn.aaai.org

Convex programming involves a convex set F⊆ Rn and a convex cost function c: F→ R. The
goal of convex programming is to find a point in F which minimizes c. In online convex …

Сохранить Цитировать Цитируется: 3058 Похожие статьи Все версии статьи (15) Поиск в библиотеках В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] ssrn.com

Learning in repeated auctions with budgets: Regret minimization and equilibrium

SR Balseiro, Y Gur - Management Science, 2019 - pubsonline.informs.org

In online advertising markets, advertisers often purchase ad placements through bidding in
repeated auctions based on realized viewer information. We study how budget-constrained …

Сохранить Цитировать Цитируется: 251 Похожие статьи Все версии статьи (9)

[КНИГА][B] Robustness

LP Hansen, TJ Sargent - 2008 - degruyter.com

The standard theory of decision making under uncertainty advises the decision maker to
form a statistical model linking outcomes to decisions and then to choose the optimal …

Сохранить Цитировать Цитируется: 1824 Похожие статьи Все версии статьи (7) Поиск в библиотеках

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Consistency and cautious fictitious play

Multi-agent reinforcement learning: A selective overview of theories and algorithms

An overview of multi-agent reinforcement learning from game theoretical perspective

A modern introduction to online learning

[КНИГА][B] Partially observed Markov decision processes

Potential games

[КНИГА][B] Prediction, learning, and games

The nonstochastic multiarmed bandit problem

[PDF][PDF] Online convex programming and generalized infinitesimal gradient ascent

Learning in repeated auctions with budgets: Regret minimization and equilibrium

[КНИГА][B] Robustness