Deep counterfactual regret minimization
Abstract Counterfactual Regret Minimization (CFR) is the leading algorithm for solving large
imperfect-information games. It converges to an equilibrium by iteratively traversing the …
imperfect-information games. It converges to an equilibrium by iteratively traversing the …
Last-iterate convergence of optimistic gradient method for monotone variational inequalities
Abstract The Past Extragradient (PEG)[Popov, 1980] method, also known as the Optimistic
Gradient method, has known a recent gain in interest in the optimization community with the …
Gradient method, has known a recent gain in interest in the optimization community with the …
Convergence of proximal point and extragradient-based methods beyond monotonicity: the case of negative comonotonicity
Algorithms for min-max optimization and variational inequalities are often studied under
monotonicity assumptions. Motivated by non-monotone machine learning applications, we …
monotonicity assumptions. Motivated by non-monotone machine learning applications, we …
Uncoupled Learning Dynamics with Swap Regret in Multiplayer Games
In this paper we establish efficient and\emph {uncoupled} learning dynamics so that, when
employed by all players in a general-sum multiplayer game, the\emph {swap regret} of each …
employed by all players in a general-sum multiplayer game, the\emph {swap regret} of each …
Suspicion-agent: Playing imperfect information games with theory of mind aware gpt-4
Unlike perfect information games, where all elements are known to every player, imperfect
information games emulate the real-world complexities of decision-making under uncertain …
information games emulate the real-world complexities of decision-making under uncertain …
A survey of opponent modeling in adversarial domains
Opponent modeling is the ability to use prior knowledge and observations in order to predict
the behavior of an opponent. This survey presents a comprehensive overview of existing …
the behavior of an opponent. This survey presents a comprehensive overview of existing …
Kernelized multiplicative weights for 0/1-polyhedral games: Bridging the gap between learning in extensive-form and normal-form games
While extensive-form games (EFGs) can be converted into normal-form games (NFGs),
doing so comes at the cost of an exponential blowup of the strategy space. So, progress on …
doing so comes at the cost of an exponential blowup of the strategy space. So, progress on …
Last-iterate convergence in extensive-form games
Regret-based algorithms are highly efficient at finding approximate Nash equilibria in
sequential games such as poker games. However, most regret-based algorithms, including …
sequential games such as poker games. However, most regret-based algorithms, including …
Faster game solving via predictive blackwell approachability: Connecting regret matching and mirror descent
Blackwell approachability is a framework for reasoning about repeated games with vector-
valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the …
valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the …
Block-coordinate methods and restarting for solving extensive-form games
Coordinate descent methods are popular in machine learning and optimization for their
simple sparse updates and excellent practical performance. In the context of large-scale …
simple sparse updates and excellent practical performance. In the context of large-scale …