An information-theoretic perspective on intrinsic motivation in reinforcement learning: A survey

A Aubret, L Matignon, S Hassas - Entropy, 2023 - mdpi.com
The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …

Hybrid rl: Using both offline and online data can make rl efficient

Y Song, Y Zhou, A Sekhari, JA Bagnell… - arxiv preprint arxiv …, 2022 - arxiv.org
We consider a hybrid reinforcement learning setting (Hybrid RL), in which an agent has
access to an offline dataset and the ability to collect experience via real-world online …

Video prediction models as rewards for reinforcement learning

A Escontrela, A Adeniji, W Yan, A Jain… - Advances in …, 2024 - proceedings.neurips.cc
Specifying reward signals that allow agents to learn complex behaviors is a long-standing
challenge in reinforcement learning. A promising approach is to extract preferences for …

Firerisk: A remote sensing dataset for fire risk assessment with benchmarks using supervised and self-supervised learning

S Shen, S Seneviratne, X Wanyan… - … Conference on Digital …, 2023 - ieeexplore.ieee.org
In recent decades, wildfires have caused tremendous property losses, fatalities, and
extensive damage to forest ecosystems. Inspired by the abundance of publicly available …

Understanding self-predictive learning for reinforcement learning

Y Tang, ZD Guo, PH Richemond… - International …, 2023 - proceedings.mlr.press
We study the learning dynamics of self-predictive learning for reinforcement learning, a
family of algorithms that learn representations by minimizing the prediction error of their own …

: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

R Zheng, X Wang, Y Sun, S Ma… - Advances in …, 2024 - proceedings.neurips.cc
Despite recent progress in reinforcement learning (RL) from raw pixel data, sample
inefficiency continues to present a substantial obstacle. Prior works have attempted to …

Mask-based latent reconstruction for reinforcement learning

T Yu, Z Zhang, C Lan, Y Lu… - Advances in Neural …, 2022 - proceedings.neurips.cc
For deep reinforcement learning (RL) from pixels, learning effective state representations is
crucial for achieving high performance. However, in practice, limited experience and high …

Augmented behavioral annotation tools, with application to multimodal datasets and models: a systematic review

E Watson, T Viana, S Zhang - AI, 2023 - mdpi.com
Annotation tools are an essential component in the creation of datasets for machine learning
purposes. Annotation tools have evolved greatly since the turn of the century, and now …

Discovering hierarchical achievements in reinforcement learning via contrastive learning

S Moon, J Yeom, B Park… - Advances in Neural …, 2024 - proceedings.neurips.cc
Discovering achievements with a hierarchical structure in procedurally generated
environments presents a significant challenge. This requires an agent to possess a broad …

Semantically aligned task decomposition in multi-agent reinforcement learning

W Li, D Qiao, B Wang, X Wang, B **, H Zha - arxiv preprint arxiv …, 2023 - arxiv.org
The difficulty of appropriately assigning credit is particularly heightened in cooperative
MARL with sparse reward, due to the concurrent time and structural scales involved …