How to train your robot with deep reinforcement learning: lessons we have learned
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
A review of robot learning for manipulation: Challenges, representations, and algorithms
A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …
interacting with the world around them to achieve their goals. The last decade has seen …
A survey of meta-reinforcement learning
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …
machine learning, it is held back from more widespread adoption by its often poor data …
Contrastive learning as goal-conditioned reinforcement learning
In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …
While deep RL should automatically acquire such good representations, prior work often …
Few-shot preference learning for human-in-the-loop rl
While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …
designing sufficiently informative reward functions for complex tasks has proven to be …
Visual foresight: Model-based deep reinforcement learning for vision-based robotic control
Deep reinforcement learning (RL) algorithms can learn complex robotic skills from raw
sensory inputs, but have yet to achieve the kind of broad generalization and applicability …
sensory inputs, but have yet to achieve the kind of broad generalization and applicability …
End-to-end robotic reinforcement learning without reward engineering
The combination of deep neural network models and reinforcement learning algorithms can
make it possible to learn policies for robotic behaviors that directly read in raw sensory …
make it possible to learn policies for robotic behaviors that directly read in raw sensory …
How to leverage unlabeled data in offline reinforcement learning
Offline reinforcement learning (RL) can learn control policies from static datasets but, like
standard RL methods, it requires reward annotations for every transition. In many cases …
standard RL methods, it requires reward annotations for every transition. In many cases …
Conservative data sharing for multi-task offline reinforcement learning
Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …
where abundant pre-collected data is available. However, prior methods focus on solving …
A survey of reinforcement learning from human feedback
Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …
(RL) that learns from human feedback instead of relying on an engineered reward function …