- Academic Search

M Bernasconi, M Castiglioni… - Advances in …, 2022 - proceedings.neurips.cc

We study a repeated information design problem faced by an informed sender who tries to
influence the behavior of a self-interested receiver. We consider settings where the receiver …

Save Cite Cited by 29 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Safe learning in tree-form sequential decision making: Handling hard and soft constraints

M Bernasconi, F Cacciamani… - International …, 2022 - proceedings.mlr.press

We study decision making problems in which an agent sequentially interacts with a
stochastic environment defined by means of a tree structure. The agent repeatedly faces the …

Save Cite Cited by 14 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Safe opponent-exploitation subgame refinement

M Liu, C Wu, Q Liu, Y **g, J Yang… - Advances in Neural …, 2022 - proceedings.neurips.cc

In zero-sum games, an NE strategy tends to be overly conservative confronted with
opponents of limited rationality, because it does not actively exploit their weaknesses. From …

Save Cite Cited by 8 Related articles All 4 versions Free GPT-4 View as HTML

Kdb-D2CFR: Solving Multiplayer imperfect-information games with knowledge distillation-based DeepCFR

H Li, Z Guo, Y Liu, X Wang, S Qi, J Zhang… - Knowledge-Based …, 2023 - Elsevier

Counterfactual regret minimization (CFR) is a popular method for finding approximate Nash
equilibrium in imperfect-information games (IIG). However, CFR based methods for the IIG …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4

Online learning in sequential Bayesian persuasion: Handling unknown priors

M Bernasconi, M Castiglioni, A Marchesi, N Gatti… - Artificial Intelligence, 2025 - Elsevier

We study a repeated information design problem faced by an informed sender who tries to
influence the behavior of a self-interested receiver, through the provision of payoff-relevant …

Modeling rationality: Toward better performance against unknown agents in sequential games

Z Ge, S Yang, P Tian, Z Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Opponent modeling is necessary for autonomous agents to capture the intents of others
during strategic interactions. Most previous works assume that they can access enough …

Save Cite Cited by 2 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] openreview.net

A General Framework for Safe Decision Making: A Convex Duality Approach

M Bernasconi, F Cacciamani, N Gatti… - NeurIPS ML Safety …, 2022 - openreview.net

We study the problem of online interaction in general decision making problems, where the
objective is not only to find optimal strategies, but also to satisfy some safety guarantees …

Save Cite Cited by 1 Related articles View as HTML

[Free GPT-4]

[HTML] sagepub.com Full View

A framework for safe decision making: A convex duality approach

M Bernasconi, F Cacciamani… - Intelligenza …, 2023 - content.iospress.com

We study the problem of online interaction in general decision making problems, where the
objective is not only to find optimal strategies, but also to satisfy certain safety guarantees …

[Free GPT-4]

[PDF] openreview.net

Safe and Robust Subgame Exploitation in Imperfect Information Games

Z Ge, Z Xu, T Ding, L Meng, B An, W Li… - Forty-first International … - openreview.net

Opponent exploitation is an important task for players to exploit the weaknesses of others in
games. Existing approaches mainly focus on balancing between exploitation and …

Save Cite Cited by 1 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] polimi.it

Online learning with uncertain constraints: cumulative and replenishable violations

M Bernasconi de Luca - 2023 - politesi.polimi.it

Learning how to act optimally in complex environments constitutes a crucial milestone in the
field of artificial intelligence. In recent years, considerable attention has been directed …

Create alert

Cite

Advanced search

Saved to My library

Exploiting opponents under utility constraints in sequential games

Sequential information design: Learning to persuade in the dark

Safe learning in tree-form sequential decision making: Handling hard and soft constraints

Safe opponent-exploitation subgame refinement

Kdb-D2CFR: Solving Multiplayer imperfect-information games with knowledge distillation-based DeepCFR

Online learning in sequential Bayesian persuasion: Handling unknown priors

Modeling rationality: Toward better performance against unknown agents in sequential games

A General Framework for Safe Decision Making: A Convex Duality Approach

A framework for safe decision making: A convex duality approach

Safe and Robust Subgame Exploitation in Imperfect Information Games

Online learning with uncertain constraints: cumulative and replenishable violations