Exploration in deep reinforcement learning: A survey
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …
techniques are of primary importance when solving sparse reward problems. In sparse …
Towards continual reinforcement learning: A review and perspectives
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
[CITAZIONE][C] An introduction to variational autoencoders
An Introduction to Variational Autoencoders Page 1 An Introduction to Variational Autoencoders
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …
Emergent tool use from multi-agent autocurricula
Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …
reinforcement learning algorithms at scale, we find that agents create a self-supervised …
An introduction to deep reinforcement learning
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …
learning. This field of research has been able to solve a wide range of complex …
Planning to explore via self-supervised world models
Reinforcement learning allows solving complex tasks, however, the learning tends to be task-
specific and the sample efficiency remains a challenge. We present Plan2Explore, a self …
specific and the sample efficiency remains a challenge. We present Plan2Explore, a self …
Reinforcement learning for intelligent healthcare applications: A survey
Discovering new treatments and personalizing existing ones is one of the major goals of
modern clinical research. In the last decade, Artificial Intelligence (AI) has enabled the …
modern clinical research. In the last decade, Artificial Intelligence (AI) has enabled the …
Behavior from the void: Unsupervised active pre-training
We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …
Large-scale study of curiosity-driven learning
Reinforcement learning algorithms rely on carefully engineering environment rewards that
are extrinsic to the agent. However, annotating each environment with hand-designed …
are extrinsic to the agent. However, annotating each environment with hand-designed …