A survey on causal reinforcement learning

Y Zeng, R Cai, F Sun, L Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …

Toward causal representation learning

B Schölkopf, F Locatello, S Bauer, NR Ke… - Proceedings of the …, 2021 - ieeexplore.ieee.org
The two fields of machine learning and graphical causality arose and are developed
separately. However, there is, now, cross-pollination and increasing interest in both fields to …

Causality for machine learning

B Schölkopf - Probabilistic and causal inference: The works of Judea …, 2022 - dl.acm.org
The machine learning community's interest in causality has significantly increased in recent
years. My understanding of causality has been shaped by Judea Pearl and a number of …

Causal reinforcement learning: A survey

Z Deng, J Jiang, G Long, C Zhang - arxiv preprint arxiv:2307.01452, 2023 - arxiv.org
Reinforcement learning is an essential paradigm for solving sequential decision problems
under uncertainty. Despite many remarkable achievements in recent decades, applying …

Independent mechanism analysis, a new concept?

L Gresele, J Von Kügelgen, V Stimper… - Advances in neural …, 2021 - proceedings.neurips.cc
Independent component analysis provides a principled framework for unsupervised
representation learning, with solid theory on the identifiability of the latent code that …

Causal influence detection for improving efficiency in reinforcement learning

M Seitzer, B Schölkopf… - Advances in Neural …, 2021 - proceedings.neurips.cc
Many reinforcement learning (RL) environments consist of independent entities that interact
sparsely. In such environments, RL agents have only limited influence over other entities in …

[HTML][HTML] A reusable benchmark of brain-age prediction from M/EEG resting-state signals

DA Engemann, A Mellot, R Höchenberger, H Banville… - Neuroimage, 2022 - Elsevier
Population-level modeling can define quantitative measures of individual aging by applying
machine learning to large volumes of brain images. These measures of brain age, obtained …

Counterfactual data augmentation using locally factored dynamics

S Pitis, E Creager, A Garg - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Many dynamic processes, including common scenarios in robotic control and reinforcement
learning (RL), involve a set of interacting subprocesses. Though the subprocesses are not …

Off-policy evaluation in partially observable environments

G Tennenholtz, U Shalit, S Mannor - … of the AAAI Conference on Artificial …, 2020 - ojs.aaai.org
This work studies the problem of batch off-policy evaluation for Reinforcement Learning in
partially observable environments. Off-policy evaluation under partial observability is …

Nonlinear invariant risk minimization: A causal approach

C Lu, Y Wu, JM Hernández-Lobato… - arxiv preprint arxiv …, 2021 - arxiv.org
Due to spurious correlations, machine learning systems often fail to generalize to
environments whose distributions differ from the ones used at training time. Prior work …