- Academic Search

Last iterate convergence in no-regret learning: constrained min-max optimization for convex-conca...

F Orabona - arxiv preprint arxiv:1912.13213, 2019 - arxiv.org

In this monograph, I introduce the basic concepts of Online Learning through a modern view
of Online Convex Optimization. Here, online learning refers to the framework of regret …

Save Cite Cited by 418 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Near-optimal no-regret learning in general games

C Daskalakis, M Fishelson… - Advances in Neural …, 2021 - proceedings.neurips.cc

Abstract We show that Optimistic Hedge--a common variant of multiplicative-weights-
updates with recency bias--attains ${\rm poly}(\log T) $ regret in multi-player general-sum …

Save Cite Cited by 116 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Finite-time last-iterate convergence for learning in multi-player games

Y Cai, A Oikonomou, W Zheng - Advances in Neural …, 2022 - proceedings.neurips.cc

We study the question of last-iterate convergence rate of the extragradient algorithm by
Korpelevich [1976] and the optimistic gradient algorithm by Popov [1980] in multi-player …

Save Cite Cited by 48 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Tight last-iterate convergence rates for no-regret learning in multi-player games

N Golowich, S Pattathil… - Advances in neural …, 2020 - proceedings.neurips.cc

We study the question of obtaining last-iterate convergence rates for no-regret learning
algorithms in multi-player games. We show that the optimistic gradient (OG) algorithm with a …

Save Cite Cited by 101 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Fast policy extragradient methods for competitive games with entropy regularization

S Cen, Y Wei, Y Chi - Advances in Neural Information …, 2021 - proceedings.neurips.cc

This paper investigates the problem of computing the equilibrium of competitive games,
which is often modeled as a constrained saddle-point optimization problem with probability …

Save Cite Cited by 88 Related articles All 12 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Linear last-iterate convergence in constrained saddle-point optimization

CY Wei, CW Lee, M Zhang, H Luo - arxiv preprint arxiv:2006.09517, 2020 - arxiv.org

Optimistic Gradient Descent Ascent (OGDA) and Optimistic Multiplicative Weights Update
(OMWU) for saddle-point optimization have received growing attention due to their favorable …

Save Cite Cited by 128 Related articles All 4 versions Free GPT-4 View as HTML

Learning in games: a systematic review

RJ Qin, Y Yu - Science China Information Sciences, 2024 - Springer

Game theory studies the mathematical models for self-interested individuals. Nash
equilibrium is arguably the most central solution in game theory. While finding the Nash …

Save Cite Cited by 2 Related articles

[Free GPT-4]

[PDF] neurips.cc

Last-iterate convergent policy gradient primal-dual methods for constrained mdps

D Ding, CY Wei, K Zhang… - Advances in Neural …, 2024 - proceedings.neurips.cc

We study the problem of computing an optimal policy of an infinite-horizon discounted
constrained Markov decision process (constrained MDP). Despite the popularity of …

Save Cite Cited by 27 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Kernelized multiplicative weights for 0/1-polyhedral games: Bridging the gap between learning in extensive-form and normal-form games

G Farina, CW Lee, H Luo… - … Conference on Machine …, 2022 - proceedings.mlr.press

While extensive-form games (EFGs) can be converted into normal-form games (NFGs),
doing so comes at the cost of an exponential blowup of the strategy space. So, progress on …

Save Cite Cited by 32 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Last-iterate convergence in extensive-form games

CW Lee, C Kroer, H Luo - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Regret-based algorithms are highly efficient at finding approximate Nash equilibria in
sequential games such as poker games. However, most regret-based algorithms, including …

Save Cite Cited by 48 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Last iterate convergence in no-regret learning: constrained min-max optimization for convex-conca...

A modern introduction to online learning

Near-optimal no-regret learning in general games

Finite-time last-iterate convergence for learning in multi-player games

Tight last-iterate convergence rates for no-regret learning in multi-player games

Fast policy extragradient methods for competitive games with entropy regularization

Linear last-iterate convergence in constrained saddle-point optimization

Learning in games: a systematic review

Last-iterate convergent policy gradient primal-dual methods for constrained mdps

Kernelized multiplicative weights for 0/1-polyhedral games: Bridging the gap between learning in extensive-form and normal-form games

Last-iterate convergence in extensive-form games