Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on causal reinforcement learning
Y Zeng, R Cai, F Sun, L Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …
making problems of many domains, it still faces key challenges of data inefficiency and the …
Interpretable reward redistribution in reinforcement learning: A causal approach
Y Zhang, Y Du, B Huang, Z Wang… - Advances in …, 2023 - proceedings.neurips.cc
A major challenge in reinforcement learning is to determine which state-action pairs are
responsible for future rewards that are delayed. Reward redistribution serves as a solution to …
responsible for future rewards that are delayed. Reward redistribution serves as a solution to …
Learning dynamic attribute-factored world models for efficient multi-object reinforcement learning
F Feng, S Magliacane - Advances in Neural Information …, 2023 - proceedings.neurips.cc
In many reinforcement learning tasks, the agent has to learn to interact with many objects of
different types and generalize to unseen combinations and numbers of objects. Often a task …
different types and generalize to unseen combinations and numbers of objects. Often a task …
Ace: Off-policy actor-critic with causality-aware entropy regularization
The varying significance of distinct primitive behaviors during the policy learning process
has been overlooked by prior model-free RL algorithms. Leveraging this insight, we explore …
has been overlooked by prior model-free RL algorithms. Leveraging this insight, we explore …
An adaptive deep rl method for non-stationary environments with piecewise stable context
One of the key challenges in deploying RL to real-world applications is to adapt to variations
of unknown environment contexts, such as changing terrains in robotic tasks and fluctuated …
of unknown environment contexts, such as changing terrains in robotic tasks and fluctuated …
Fast teammate adaptation in the presence of sudden policy change
Cooperative multi-agent reinforcement learning (MARL), where agents coordinates with
teammate (s) for a shared goal, may sustain non-stationary caused by the policy change of …
teammate (s) for a shared goal, may sustain non-stationary caused by the policy change of …
Provably efficient causal model-based reinforcement learning for systematic generalization
In the sequential decision making setting, an agent aims to achieve systematic
generalization over a large, possibly infinite, set of environments. Such environments are …
generalization over a large, possibly infinite, set of environments. Such environments are …
Tempo adaptation in non-stationary reinforcement learning
H Lee, Y Ding, J Lee, M **… - Advances in Neural …, 2023 - proceedings.neurips.cc
We first raise and tackle a``time synchronization''issue between the agent and the
environment in non-stationary reinforcement learning (RL), a crucial factor hindering its real …
environment in non-stationary reinforcement learning (RL), a crucial factor hindering its real …
A robust test for the stationarity assumption in sequential decision making
Reinforcement learning (RL) is a powerful technique that allows an autonomous agent to
learn an optimal policy to maximize the expected return. The optimality of various RL …
learn an optimal policy to maximize the expected return. The optimality of various RL …
Causal Temporal Representation Learning with Nonstationary Sparse Transition
Abstract Causal Temporal Representation Learning (Ctrl) methods aim to identify the
temporal causal dynamics of complex nonstationary temporal sequences. Despite the …
temporal causal dynamics of complex nonstationary temporal sequences. Despite the …