Guiding pretraining in reinforcement learning with large language models
Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped
reward function. Intrinsically motivated exploration methods address this limitation by …
reward function. Intrinsically motivated exploration methods address this limitation by …
A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
This article is about deep learning (DL) and deep reinforcement learning (DRL) works
applied to robotics. Both tools have been shown to be successful in delivering data-driven …
applied to robotics. Both tools have been shown to be successful in delivering data-driven …
Mastering visual continuous control: Improved data-augmented reinforcement learning
We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual
continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data …
continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data …
An information-theoretic perspective on intrinsic motivation in reinforcement learning: A survey
The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …
new contributions, especially considering the emergent field of deep RL (DRL). However, a …
Masked world models for visual control
Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …
robot learning from visual observations. Yet the current approaches typically train a single …
Reinforcement learning with action-free pre-training from videos
Recent unsupervised pre-training methods have shown to be effective on language and
vision domains by learning useful representations for multiple downstream tasks. In this …
vision domains by learning useful representations for multiple downstream tasks. In this …
Behavior from the void: Unsupervised active pre-training
We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …
Aps: Active pretraining with successor features
We introduce a new unsupervised pretraining objective for reinforcement learning. During
the unsupervised reward-free pretraining phase, the agent maximizes mutual information …
the unsupervised reward-free pretraining phase, the agent maximizes mutual information …
Pretraining representations for data-efficient reinforcement learning
Data efficiency is a key challenge for deep reinforcement learning. We address this problem
by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of …
by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of …
Urlb: Unsupervised reinforcement learning benchmark
Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range
of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to …
of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to …