Fictitious self-play in extensive-form games
Fictitious play is a popular game-theoretic model of learning in games. However, it has
received little attention in practical applications to large problems. This paper introduces two …
received little attention in practical applications to large problems. This paper introduces two …
XDO: A double oracle algorithm for extensive-form games
Abstract Policy Space Response Oracles (PSRO) is a reinforcement learning (RL) algorithm
for two-player zero-sum games that has been empirically shown to find approximate Nash …
for two-player zero-sum games that has been empirically shown to find approximate Nash …
Faster game solving via predictive blackwell approachability: Connecting regret matching and mirror descent
Blackwell approachability is a framework for reasoning about repeated games with vector-
valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the …
valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the …
Solving large-scale pursuit-evasion games using pre-trained strategies
Pursuit-evasion games on graphs model the coordination of police forces chasing a fleeing
felon in real-world urban settings, using the standard framework of imperfect-information …
felon in real-world urban settings, using the standard framework of imperfect-information …
Faster algorithms for extensive-form game solving via improved smoothing functions
Sparse iterative methods, in particular first-order methods, are known to be among the most
effective in solving large-scale two-player zero-sum extensive-form games. The …
effective in solving large-scale two-player zero-sum extensive-form games. The …
[HTML][HTML] Co-optimization of multiple virtual power plants considering electricity-heat-carbon trading: A Stackelberg game strategy
J Cao, D Yang, P Dehghanian - International Journal of Electrical Power & …, 2023 - Elsevier
With the improvement of the electricity-heat-carbon trading mechanism, it has been a trend
for multiple virtual power plants (MVPP) to participate in the market competition. Firstly, in …
for multiple virtual power plants (MVPP) to participate in the market competition. Firstly, in …
Optimizing honeypot strategies against dynamic lateral movement using partially observable stochastic games
Partially observable stochastic games (POSGs) are a general game-theoretic model for
capturing dynamic interactions where players have partial information. The existing …
capturing dynamic interactions where players have partial information. The existing …
Heuristic search value iteration for one-sided partially observable stochastic games
Security problems can be modeled as two-player partially observable stochastic games with
one-sided partial observability and infinite horizon (one-sided POSGs). We seek for optimal …
one-sided partial observability and infinite horizon (one-sided POSGs). We seek for optimal …
Better regularization for sequential decision spaces: Fast convergence rates for Nash, correlated, and team equilibria
We study the application of iterative first-order methods to the problem of computing
equilibria of large-scale two-player extensive-form games. First-order methods must typically …
equilibria of large-scale two-player extensive-form games. First-order methods must typically …
Approximate solutions for attack graph games with imperfect information
We study the problem of network security hardening, in which a network administrator
decides what security measures to use to best improve the security of the network …
decides what security measures to use to best improve the security of the network …