How to train your robot with deep reinforcement learning: lessons we have learned
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
A review on deep learning techniques for video prediction
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …
intelligent decision-making systems. In light of the success of deep learning in computer …
Daydreamer: World models for physical robot learning
To solve tasks in complex environments, robots need to learn from experience. Deep
reinforcement learning is a common approach to robot learning but requires a large amount …
reinforcement learning is a common approach to robot learning but requires a large amount …
Emergent tool use from multi-agent autocurricula
Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …
reinforcement learning algorithms at scale, we find that agents create a self-supervised …
Mt-opt: Continuous multi-task robotic reinforcement learning at scale
General-purpose robotic systems must master a large repertoire of diverse skills to be useful
in a range of daily tasks. While reinforcement learning provides a powerful framework for …
in a range of daily tasks. While reinforcement learning provides a powerful framework for …
Robonet: Large-scale multi-robot learning
Robot learning has emerged as a promising tool for taming the complexity and diversity of
the real world. Methods based on high-capacity models, such as deep networks, hold the …
the real world. Methods based on high-capacity models, such as deep networks, hold the …
Greedy hierarchical variational autoencoders for large-scale video prediction
A video prediction model that generalizes to diverse scenes would enable intelligent agents
such as robots to perform a variety of tasks via planning with the model. However, while …
such as robots to perform a variety of tasks via planning with the model. However, while …
How to leverage unlabeled data in offline reinforcement learning
Offline reinforcement learning (RL) can learn control policies from static datasets but, like
standard RL methods, it requires reward annotations for every transition. In many cases …
standard RL methods, it requires reward annotations for every transition. In many cases …
Parrot: Data-driven behavioral priors for reinforcement learning
Reinforcement learning provides a general framework for flexible decision making and
control, but requires extensive data collection for each new task that an agent needs to …
control, but requires extensive data collection for each new task that an agent needs to …
Conservative data sharing for multi-task offline reinforcement learning
Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …
where abundant pre-collected data is available. However, prior methods focus on solving …