Conservative data sharing for multi-task offline reinforcement learning
Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …
where abundant pre-collected data is available. However, prior methods focus on solving …
Usher: Unbiased sampling for hindsight experience replay
Dealing with sparse rewards is a long-standing challenge in reinforcement learning (RL).
Hindsight Experience Replay (HER) addresses this problem by reusing failed trajectories for …
Hindsight Experience Replay (HER) addresses this problem by reusing failed trajectories for …
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning
H Chen, W Wan, M Matsushita, T Kotaka… - arxiv preprint arxiv …, 2024 - arxiv.org
A combined task-level reinforcement learning and motion planning framework is proposed
in this paper to address a multi-class in-rack test tube rearrangement problem. At the task …
in this paper to address a multi-class in-rack test tube rearrangement problem. At the task …
Clustering-based Failed goal Aware Hindsight Experience Replay
T Kim, T Kang, H Jeong, D Har - PeerJ Computer Science, 2024 - peerj.com
In a multi-goal reinforcement learning environment, an agent learns a policy to perform tasks
with multiple goals from experiences gained through exploration. In environments with …
with multiple goals from experiences gained through exploration. In environments with …
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning
L Wu, K Chen - arxiv preprint arxiv:2311.17565, 2023 - arxiv.org
In goal-conditioned reinforcement learning (GCRL), sparse rewards present significant
challenges, often obstructing efficient learning. Although multi-step GCRL can boost this …
challenges, often obstructing efficient learning. Although multi-step GCRL can boost this …
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning
Offline reinforcement learning (RL) bears the promise to learn effective control policies from
static datasets but is thus far unable to learn from large databases of heterogeneous …
static datasets but is thus far unable to learn from large databases of heterogeneous …
Boosting Learning Efficiency in Goal-Conditioned Reinforcement Learning: Skill Augmentation and Multi-Step Learning
L Wu - 2024 - search.proquest.com
Goal-conditioned reinforcement learning (GCRL) has emerged as a promising avenue in
machine learning, offering the potential to master multiple tasks tailored to specific goals and …
machine learning, offering the potential to master multiple tasks tailored to specific goals and …
An unbiased method to train robots traveling in special conditions
T Zhou - AIP Conference Proceedings, 2024 - pubs.aip.org
It is a challenge to make robots move from one place to another on the shortest path and
avoid obstacles at the same time, especially when there are some special conditions occur …
avoid obstacles at the same time, especially when there are some special conditions occur …
[書籍][B] Reinforcement Learning from Static Datasets: Algorithms, Analysis, and Applications
A Kumar - 2023 - search.proquest.com
Reinforcement learning (RL) provides a formalism for learning-based control. By attempting
to learn behavioral policies that can optimize a user-specified reward function, RL methods …
to learn behavioral policies that can optimize a user-specified reward function, RL methods …
[書籍][B] Building Versatile Reinforcement Learning Agents with Offline Data
T Yu - 2022 - search.proquest.com
Recent advances in machine learning using deep neural networks have shown significant
successes in learning from large datasets. However, these successes concentrated on …
successes in learning from large datasets. However, these successes concentrated on …