Playfusion: Skill acquisition via diffusion from language-annotated play

L Chen, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press
Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …

Graph deep learning: State of the art and challenges

S Georgousis, MP Kenning, X **e - IEEE Access, 2021 - ieeexplore.ieee.org
The last half-decade has seen a surge in deep learning research on irregular domains and
efforts to extend convolutional neural networks (CNNs) to work on irregularly structured data …

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning

A Gupta, V Kumar, C Lynch, S Levine… - arxiv preprint arxiv …, 2019 - arxiv.org
We present relay policy learning, a method for imitation and reinforcement learning that can
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …

Parrot: Data-driven behavioral priors for reinforcement learning

A Singh, H Liu, G Zhou, A Yu, N Rhinehart… - arxiv preprint arxiv …, 2020 - arxiv.org
Reinforcement learning provides a general framework for flexible decision making and
control, but requires extensive data collection for each new task that an agent needs to …

Learning neuro-symbolic skills for bilevel planning

T Silver, A Athalye, JB Tenenbaum… - arxiv preprint arxiv …, 2022 - arxiv.org
Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …

Generalizing goal-conditioned reinforcement learning with variational causal reasoning

W Ding, H Lin, B Li, D Zhao - Advances in Neural …, 2022 - proceedings.neurips.cc
As a pivotal component to attaining generalizable solutions in human intelligence,
reasoning provides great potential for reinforcement learning (RL) agents' generalization …

Multi-stage cable routing through hierarchical imitation learning

J Luo, C Xu, X Geng, G Feng, K Fang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
We study the problem of learning to perform multistage robotic manipulation tasks, with
applications to cable routing, where the robot must route a cable through a series of clips …

Hierarchical diffusion for offline decision making

W Li, X Wang, B **, H Zha - International Conference on …, 2023 - proceedings.mlr.press
Offline reinforcement learning typically introduces a hierarchical structure to solve the long-
horizon problem so as to address its thorny issue of variance accumulation. Problems of …

Recent advances in leveraging human guidance for sequential decision-making tasks

R Zhang, F Torabi, G Warnell, P Stone - Autonomous Agents and Multi …, 2021 - Springer
A longstanding goal of artificial intelligence is to create artificial agents capable of learning
to perform tasks that require sequential decision making. Importantly, while it is the artificial …

Dichotomy of control: Separating what you can control from what you cannot

M Yang, D Schuurmans, P Abbeel… - arxiv preprint arxiv …, 2022 - arxiv.org
Future-or return-conditioned supervised learning is an emerging paradigm for offline
reinforcement learning (RL), where the future outcome (ie, return) associated with an …