A survey on causal reinforcement learning

Y Zeng, R Cai, F Sun, L Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …

Interpretable reward redistribution in reinforcement learning: A causal approach

Y Zhang, Y Du, B Huang, Z Wang… - Advances in …, 2023 - proceedings.neurips.cc
A major challenge in reinforcement learning is to determine which state-action pairs are
responsible for future rewards that are delayed. Reward redistribution serves as a solution to …

Learning dynamic attribute-factored world models for efficient multi-object reinforcement learning

F Feng, S Magliacane - Advances in Neural Information …, 2023 - proceedings.neurips.cc
In many reinforcement learning tasks, the agent has to learn to interact with many objects of
different types and generalize to unseen combinations and numbers of objects. Often a task …

Ace: Off-policy actor-critic with causality-aware entropy regularization

T Ji, Y Liang, Y Zeng, Y Luo, G Xu, J Guo… - arxiv preprint arxiv …, 2024 - arxiv.org
The varying significance of distinct primitive behaviors during the policy learning process
has been overlooked by prior model-free RL algorithms. Leveraging this insight, we explore …

An adaptive deep rl method for non-stationary environments with piecewise stable context

X Chen, X Zhu, Y Zheng, P Zhang… - Advances in …, 2022 - proceedings.neurips.cc
One of the key challenges in deploying RL to real-world applications is to adapt to variations
of unknown environment contexts, such as changing terrains in robotic tasks and fluctuated …

Fast teammate adaptation in the presence of sudden policy change

Z Zhang, L Yuan, L Li, K Xue, C Jia… - Uncertainty in …, 2023 - proceedings.mlr.press
Cooperative multi-agent reinforcement learning (MARL), where agents coordinates with
teammate (s) for a shared goal, may sustain non-stationary caused by the policy change of …

Provably efficient causal model-based reinforcement learning for systematic generalization

M Mutti, R De Santi, E Rossi, JF Calderon… - Proceedings of the …, 2023 - ojs.aaai.org
In the sequential decision making setting, an agent aims to achieve systematic
generalization over a large, possibly infinite, set of environments. Such environments are …

Tempo adaptation in non-stationary reinforcement learning

H Lee, Y Ding, J Lee, M **… - Advances in Neural …, 2023 - proceedings.neurips.cc
We first raise and tackle a``time synchronization''issue between the agent and the
environment in non-stationary reinforcement learning (RL), a crucial factor hindering its real …

A robust test for the stationarity assumption in sequential decision making

J Wang, C Shi, Z Wu - International Conference on Machine …, 2023 - proceedings.mlr.press
Reinforcement learning (RL) is a powerful technique that allows an autonomous agent to
learn an optimal policy to maximize the expected return. The optimality of various RL …

Causal Temporal Representation Learning with Nonstationary Sparse Transition

X Song, Z Li, G Chen, Y Zheng, Y Fan… - Advances in …, 2025 - proceedings.neurips.cc
Abstract Causal Temporal Representation Learning (Ctrl) methods aim to identify the
temporal causal dynamics of complex nonstationary temporal sequences. Despite the …