A survey on causal reinforcement learning
While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …
making problems of many domains, it still faces key challenges of data inefficiency and the …
Causal reinforcement learning: A survey
Reinforcement learning is an essential paradigm for solving sequential decision problems
under uncertainty. Despite many remarkable achievements in recent decades, applying …
under uncertainty. Despite many remarkable achievements in recent decades, applying …
Causal confusion in imitation learning
Behavioral cloning reduces policy learning to supervised learning by training a
discriminative model to predict expert actions given observations. Such discriminative …
discriminative model to predict expert actions given observations. Such discriminative …
Structural causal bandits: Where to intervene?
We study the problem of identifying the best action in a sequential decision-making setting
when the reward distributions of the arms exhibit a non-trivial dependence structure, which …
when the reward distributions of the arms exhibit a non-trivial dependence structure, which …
Causal bandits with unknown graph structure
In causal bandit problems the action set consists of interventions on variables of a causal
graph. Several researchers have recently studied such bandit problems and pointed out …
graph. Several researchers have recently studied such bandit problems and pointed out …
Regret analysis of bandit problems with causal background knowledge
We study how to learn optimal interventions sequentially given causal information
represented as a causal graph along with associated conditional distributions. Causal …
represented as a causal graph along with associated conditional distributions. Causal …
Active learning for optimal intervention design in causal models
Sequential experimental design to discover interventions that achieve a desired outcome is
a key problem in various domains including science, engineering and public policy. When …
a key problem in various domains including science, engineering and public policy. When …
Provably efficient causal reinforcement learning with confounded observational data
Empowered by neural networks, deep reinforcement learning (DRL) achieves tremendous
empirical success. However, DRL requires a large dataset by interacting with the …
empirical success. However, DRL requires a large dataset by interacting with the …
Budgeted and non-budgeted causal bandits
Learning good interventions in a causal graph can be modelled as a stochastic multi-armed
bandit problem with side-information. First, we study this problem when interventions are …
bandit problem with side-information. First, we study this problem when interventions are …
Causal bandits for linear structural equation models
This paper studies the problem of designing an optimal sequence of interventions in a
causal graphical model to minimize cumulative regret with respect to the best intervention in …
causal graphical model to minimize cumulative regret with respect to the best intervention in …