An information-theoretic perspective on intrinsic motivation in reinforcement learning: A survey
The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …
new contributions, especially considering the emergent field of deep RL (DRL). However, a …
Byol-explore: Exploration by bootstrapped prediction
We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven
exploration in visually complex environments. BYOL-Explore learns the world …
exploration in visually complex environments. BYOL-Explore learns the world …
Convex reinforcement learning in finite trials
Convex Reinforcement Learning (RL) is a recently introduced framework that generalizes
the standard RL objective to any convex (or concave) function of the state distribution …
the standard RL objective to any convex (or concave) function of the state distribution …
Maximum state entropy exploration using predecessor and successor representations
Animals have a developed ability to explore that aids them in important tasks such as
locating food, exploring for shelter, and finding misplaced items. These exploration skills …
locating food, exploring for shelter, and finding misplaced items. These exploration skills …
Accelerating reinforcement learning with value-conditional state entropy exploration
A promising technique for exploration is to maximize the entropy of visited state distribution,
ie, state entropy, by encouraging uniform coverage of visited state space. While it has been …
ie, state entropy, by encouraging uniform coverage of visited state space. While it has been …
[PDF][PDF] Structure in reinforcement learning: A survey and open problems
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …
Networks (DNNs) for function approximation, has demonstrated considerable success in …
Cem: Constrained entropy maximization for task-agnostic safe exploration
In the absence of assigned tasks, a learning agent typically seeks to explore its environment
efficiently. However, the pursuit of exploration will bring more safety risks. An under-explored …
efficiently. However, the pursuit of exploration will bring more safety risks. An under-explored …
The importance of non-markovianity in maximum state entropy exploration
In the maximum state entropy exploration framework, an agent interacts with a reward-free
environment to learn a policy that maximizes the entropy of the expected state visitations it is …
environment to learn a policy that maximizes the entropy of the expected state visitations it is …
Challenging common assumptions in convex reinforcement learning
Abstract The classic Reinforcement Learning (RL) formulation concerns the maximization of
a scalar reward function. More recently, convex RL has been introduced to extend the RL …
a scalar reward function. More recently, convex RL has been introduced to extend the RL …
Pretraining in deep reinforcement learning: A survey
The past few years have seen rapid progress in combining reinforcement learning (RL) with
deep learning. Various breakthroughs ranging from games to robotics have spurred the …
deep learning. Various breakthroughs ranging from games to robotics have spurred the …