Sequential information design: Learning to persuade in the dark

M Bernasconi, M Castiglioni… - Advances in …, 2022 - proceedings.neurips.cc
We study a repeated information design problem faced by an informed sender who tries to
influence the behavior of a self-interested receiver. We consider settings where the receiver …

Safe learning in tree-form sequential decision making: Handling hard and soft constraints

M Bernasconi, F Cacciamani… - International …, 2022 - proceedings.mlr.press
We study decision making problems in which an agent sequentially interacts with a
stochastic environment defined by means of a tree structure. The agent repeatedly faces the …

Safe opponent-exploitation subgame refinement

M Liu, C Wu, Q Liu, Y **g, J Yang… - Advances in Neural …, 2022 - proceedings.neurips.cc
In zero-sum games, an NE strategy tends to be overly conservative confronted with
opponents of limited rationality, because it does not actively exploit their weaknesses. From …

Kdb-D2CFR: Solving Multiplayer imperfect-information games with knowledge distillation-based DeepCFR

H Li, Z Guo, Y Liu, X Wang, S Qi, J Zhang… - Knowledge-Based …, 2023 - Elsevier
Counterfactual regret minimization (CFR) is a popular method for finding approximate Nash
equilibrium in imperfect-information games (IIG). However, CFR based methods for the IIG …

Online learning in sequential Bayesian persuasion: Handling unknown priors

M Bernasconi, M Castiglioni, A Marchesi, N Gatti… - Artificial Intelligence, 2025 - Elsevier
We study a repeated information design problem faced by an informed sender who tries to
influence the behavior of a self-interested receiver, through the provision of payoff-relevant …

Modeling rationality: Toward better performance against unknown agents in sequential games

Z Ge, S Yang, P Tian, Z Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Opponent modeling is necessary for autonomous agents to capture the intents of others
during strategic interactions. Most previous works assume that they can access enough …

A General Framework for Safe Decision Making: A Convex Duality Approach

M Bernasconi, F Cacciamani, N Gatti… - NeurIPS ML Safety …, 2022 - openreview.net
We study the problem of online interaction in general decision making problems, where the
objective is not only to find optimal strategies, but also to satisfy some safety guarantees …

A framework for safe decision making: A convex duality approach

M Bernasconi, F Cacciamani… - Intelligenza …, 2023 - content.iospress.com
We study the problem of online interaction in general decision making problems, where the
objective is not only to find optimal strategies, but also to satisfy certain safety guarantees …

Safe and Robust Subgame Exploitation in Imperfect Information Games

Z Ge, Z Xu, T Ding, L Meng, B An, W Li… - Forty-first International … - openreview.net
Opponent exploitation is an important task for players to exploit the weaknesses of others in
games. Existing approaches mainly focus on balancing between exploitation and …

Online learning with uncertain constraints: cumulative and replenishable violations

M Bernasconi de Luca - 2023 - politesi.polimi.it
Learning how to act optimally in complex environments constitutes a crucial milestone in the
field of artificial intelligence. In recent years, considerable attention has been directed …