A survey on causal reinforcement learning
While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …
making problems of many domains, it still faces key challenges of data inefficiency and the …
Mocoda: Model-based counterfactual data augmentation
The number of states in a dynamic process is exponential in the number of objects, making
reinforcement learning (RL) difficult in complex, multi-object domains. For agents to scale to …
reinforcement learning (RL) difficult in complex, multi-object domains. For agents to scale to …
Proximal reinforcement learning: Efficient off-policy evaluation in partially observed markov decision processes
In applications of offline reinforcement learning to observational data, such as in healthcare
or education, a general concern is that observed actions might be affected by unobserved …
or education, a general concern is that observed actions might be affected by unobserved …
Offline imitation learning with variational counterfactual reasoning
In offline imitation learning (IL), an agent aims to learn an optimal expert behavior policy
without additional online environment interactions. However, in many real-world scenarios …
without additional online environment interactions. However, in many real-world scenarios …
[PDF][PDF] Causal inference q-network: Toward resilient reinforcement learning
Deep reinforcement learning (DRL) has demonstrated impressive performance in various
gaming simulators and real-world applications. In practice, however, a DRL agent may …
gaming simulators and real-world applications. In practice, however, a DRL agent may …
Offline imitation learning with variational counterfactual reasoning
In offline Imitation Learning (IL), an agent aims to learn an optimal expert behavior policy
without additional online environment interactions. However, in many real-world scenarios …
without additional online environment interactions. However, in many real-world scenarios …
Training a resilient q-network against observational interference
Deep reinforcement learning (DRL) has demonstrated impressive performance in various
gaming simulators and real-world applications. In practice, however, a DRL agent may …
gaming simulators and real-world applications. In practice, however, a DRL agent may …
Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning
Causal dynamics learning has recently emerged as a promising approach to enhancing
robustness in reinforcement learning (RL). Typically, the goal is to build a dynamics model …
robustness in reinforcement learning (RL). Typically, the goal is to build a dynamics model …
Learning under adversarial and interventional shifts
Machine learning models are often trained on data from one distribution and deployed on
others. So it becomes important to design models that are robust to distribution shifts. Most of …
others. So it becomes important to design models that are robust to distribution shifts. Most of …
Towards robust off-policy evaluation via human inputs
Off-policy Evaluation (OPE) methods are crucial tools for evaluating policies in high-stakes
domains such as healthcare, where direct deployment is often infeasible, unethical, or …
domains such as healthcare, where direct deployment is often infeasible, unethical, or …