Playfusion: Skill acquisition via diffusion from language-annotated play
Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …
generative approaches in language or vision. Such unstructured and unguided behavior …
Graph deep learning: State of the art and challenges
The last half-decade has seen a surge in deep learning research on irregular domains and
efforts to extend convolutional neural networks (CNNs) to work on irregularly structured data …
efforts to extend convolutional neural networks (CNNs) to work on irregularly structured data …
Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning
We present relay policy learning, a method for imitation and reinforcement learning that can
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …
Parrot: Data-driven behavioral priors for reinforcement learning
Reinforcement learning provides a general framework for flexible decision making and
control, but requires extensive data collection for each new task that an agent needs to …
control, but requires extensive data collection for each new task that an agent needs to …
Learning neuro-symbolic skills for bilevel planning
Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …
Generalizing goal-conditioned reinforcement learning with variational causal reasoning
As a pivotal component to attaining generalizable solutions in human intelligence,
reasoning provides great potential for reinforcement learning (RL) agents' generalization …
reasoning provides great potential for reinforcement learning (RL) agents' generalization …
Multi-stage cable routing through hierarchical imitation learning
We study the problem of learning to perform multistage robotic manipulation tasks, with
applications to cable routing, where the robot must route a cable through a series of clips …
applications to cable routing, where the robot must route a cable through a series of clips …
Hierarchical diffusion for offline decision making
Offline reinforcement learning typically introduces a hierarchical structure to solve the long-
horizon problem so as to address its thorny issue of variance accumulation. Problems of …
horizon problem so as to address its thorny issue of variance accumulation. Problems of …
Recent advances in leveraging human guidance for sequential decision-making tasks
A longstanding goal of artificial intelligence is to create artificial agents capable of learning
to perform tasks that require sequential decision making. Importantly, while it is the artificial …
to perform tasks that require sequential decision making. Importantly, while it is the artificial …
Dichotomy of control: Separating what you can control from what you cannot
Future-or return-conditioned supervised learning is an emerging paradigm for offline
reinforcement learning (RL), where the future outcome (ie, return) associated with an …
reinforcement learning (RL), where the future outcome (ie, return) associated with an …