- Academic Search

Implicit learning dynamics in stackelberg games: Equilibria characterization, convergence analysis, and empirical study

T Fiez, B Chasnov, L Ratliff - International Conference on …, 2020 - proceedings.mlr.press

Contemporary work on learning in continuous games has commonly overlooked the
hierarchical decision-making structure present in machine learning problems formulated as …

Opslaan Citeren Geciteerd door 140 Verwante artikelen Alle 7 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Global convergence to local minmax equilibrium in classes of nonconvex zero-sum games

T Fiez, L Ratliff, E Mazumdar… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study gradient descent-ascent learning dynamics with timescale separation ($\tau $-
GDA) in unconstrained continuous action zero-sum games where the minimizing player …

Opslaan Citeren Geciteerd door 34 Verwante artikelen Alle 7 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] nsf.gov

[PDF][PDF] Local convergence analysis of gradient descent ascent with finite timescale separation

T Fiez, LJ Ratliff - Proceedings of the International Conference on …, 2021 - par.nsf.gov

We study the role that a finite timescale separation parameter τ has on gradient descent-
ascent in non-convex, non-concave zero-sum games where the learning rate of player 1 is …

Opslaan Citeren Geciteerd door 38 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Policy-gradient algorithms have no guarantees of convergence in linear quadratic games

E Mazumdar, LJ Ratliff, MI Jordan, SS Sastry - arxiv preprint arxiv …, 2019 - arxiv.org

We show by counterexample that policy-gradient algorithms have no guarantees of even
local convergence to Nash equilibria in continuous action and state space multi-agent …

Opslaan Citeren Geciteerd door 42 Verwante artikelen Alle 11 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Solving min-max optimization with hidden structure via gradient descent ascent

EV Vlatakis-Gkaragkounis, L Flokas… - Advances in Neural …, 2021 - proceedings.neurips.cc

Many recent AI architectures are inspired by zero-sum games, however, the behavior of their
dynamics is still not well understood. Inspired by this, we study standard gradient descent …

Opslaan Citeren Geciteerd door 22 Verwante artikelen Alle 9 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Gradient descent-ascent provably converges to strict local minmax equilibria with a finite timescale separation

T Fiez, L Ratliff - arxiv preprint arxiv:2009.14820, 2020 - arxiv.org

We study the role that a finite timescale separation parameter $\tau $ has on gradient
descent-ascent in two-player non-convex, non-concave zero-sum games where the learning …

Opslaan Citeren Geciteerd door 25 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Generalized natural gradient flows in hidden convex-concave games and gans

A Mladenovic, I Sakos, G Gidel… - … Conference on Learning …, 2021 - openreview.net

Game-theoretic formulations in machine learning have recently risen in prominence,
whereby entire modeling paradigms are best captured as zero-sum games. Despite their …

Opslaan Citeren Geciteerd door 8 Verwante artikelen HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Limiting behaviors of nonconvex-nonconcave minimax optimization via continuous-time systems

B Grimmer, H Lu, P Worah… - … on Algorithmic Learning …, 2022 - proceedings.mlr.press

Unlike nonconvex optimization, where gradient descent is guaranteed to converge to a local
optimizer, algorithms for nonconvex-nonconcave minimax optimization can have …

Opslaan Citeren Geciteerd door 10 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A note on large deviations for interacting particle dynamics for finding mixed equilibria in zero-sum games

V Nilsson, P Nyquist - arxiv preprint arxiv:2206.15177, 2022 - arxiv.org

Finding equilibria points in continuous minimax games has become a key problem within
machine learning, in part due to its connection to the training of generative adversarial …

Opslaan Citeren Geciteerd door 2 Verwante artikelen Alle 2 versies HTML-versie

[BOEK][B] Beyond Worst-Case Analysis of Optimization in the Era of Machine Learning

EV Vlatakis-Gkaragkounis - 2022 - search.proquest.com

Worst-case analysis (WCA) has been the dominant tool for understanding the performance
of the lion share of algorithmic arsenal of theoretical computer science. While WCA has …

Opslaan Citeren Verwante artikelen Alle 3 versies In bibliotheek zoeken

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Local nash equilibria are isolated, strict local nash equilibria in ‘almost all’zero-sum...

Implicit learning dynamics in stackelberg games: Equilibria characterization, convergence analysis, and empirical study

Global convergence to local minmax equilibrium in classes of nonconvex zero-sum games

[PDF][PDF] Local convergence analysis of gradient descent ascent with finite timescale separation

Policy-gradient algorithms have no guarantees of convergence in linear quadratic games

Solving min-max optimization with hidden structure via gradient descent ascent

Gradient descent-ascent provably converges to strict local minmax equilibria with a finite timescale separation

Generalized natural gradient flows in hidden convex-concave games and gans

Limiting behaviors of nonconvex-nonconcave minimax optimization via continuous-time systems

A note on large deviations for interacting particle dynamics for finding mixed equilibria in zero-sum games

[BOEK][B] Beyond Worst-Case Analysis of Optimization in the Era of Machine Learning