An information-theoretic perspective on intrinsic motivation in reinforcement learning: A survey

A Aubret, L Matignon, S Hassas - Entropy, 2023 - mdpi.com
The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …

Video prediction models as rewards for reinforcement learning

A Escontrela, A Adeniji, W Yan, A Jain… - Advances in …, 2023 - proceedings.neurips.cc
Specifying reward signals that allow agents to learn complex behaviors is a long-standing
challenge in reinforcement learning. A promising approach is to extract preferences for …

: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

R Zheng, X Wang, Y Sun, S Ma… - Advances in …, 2023 - proceedings.neurips.cc
Despite recent progress in reinforcement learning (RL) from raw pixel data, sample
inefficiency continues to present a substantial obstacle. Prior works have attempted to …

Hybrid rl: Using both offline and online data can make rl efficient

Y Song, Y Zhou, A Sekhari, JA Bagnell… - arxiv preprint arxiv …, 2022 - arxiv.org
We consider a hybrid reinforcement learning setting (Hybrid RL), in which an agent has
access to an offline dataset and the ability to collect experience via real-world online …

Inference via interpolation: Contrastive representations provably enable planning and inference

B Eysenbach, V Myers… - Advances in Neural …, 2025 - proceedings.neurips.cc
Given time series data, how can we answer questions like what will happen in the
future?''and how did we get here?''These sorts of probabilistic inference questions are …

A cookbook of self-supervised learning

R Balestriero, M Ibrahim, V Sobal, A Morcos… - arxiv preprint arxiv …, 2023 - arxiv.org
Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to
advance machine learning. Yet, much like cooking, training SSL methods is a delicate art …

Understanding self-predictive learning for reinforcement learning

Y Tang, ZD Guo, PH Richemond… - International …, 2023 - proceedings.mlr.press
We study the learning dynamics of self-predictive learning for reinforcement learning, a
family of algorithms that learn representations by minimizing the prediction error of their own …

Augmented behavioral annotation tools, with application to multimodal datasets and models: A systematic review

E Watson, T Viana, S Zhang - AI, 2023 - mdpi.com
Annotation tools are an essential component in the creation of datasets for machine learning
purposes. Annotation tools have evolved greatly since the turn of the century, and now …

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

Mask-based latent reconstruction for reinforcement learning

T Yu, Z Zhang, C Lan, Y Lu… - Advances in Neural …, 2022 - proceedings.neurips.cc
For deep reinforcement learning (RL) from pixels, learning effective state representations is
crucial for achieving high performance. However, in practice, limited experience and high …