Optimal goal-reaching reinforcement learning via quasimetric learning

T Wang, A Torralba, P Isola… - … Conference on Machine …, 2023 - proceedings.mlr.press
In goal-reaching reinforcement learning (RL), the optimal value function has a particular
geometry, called quasimetrics structure. This paper introduces Quasimetric Reinforcement …

Facing off world model backbones: Rnns, transformers, and s4

F Deng, J Park, S Ahn - Advances in Neural Information …, 2023 - proceedings.neurips.cc
World models are a fundamental component in model-based reinforcement learning
(MBRL). To perform temporally extended and consistent simulations of the future in partially …

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2023 - proceedings.neurips.cc
Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

Structure in deep reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Denoised mdps: Learning world models better than the world itself

T Wang, SS Du, A Torralba, P Isola, A Zhang… - arxiv preprint arxiv …, 2022 - arxiv.org
The ability to separate signal from noise, and reason with clean abstractions, is critical to
intelligence. With this ability, humans can efficiently perform real world tasks without …

Dreamerpro: Reconstruction-free model-based reinforcement learning with prototypical representations

F Deng, I Jang, S Ahn - International Conference on …, 2022 - proceedings.mlr.press
Abstract Reconstruction-based Model-Based Reinforcement Learning (MBRL) agents, such
as Dreamer, often fail to discard task-irrelevant visual distractions that are prevalent in …

Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning

D Bertoin, A Zouitine, M Zouitine… - Advances in neural …, 2022 - proceedings.neurips.cc
Deep reinforcement learning policies, despite their outstanding efficiency in simulated visual
control tasks, have shown disappointing ability to generalize across disturbances in the …

Inverse dynamics pretraining learns good representations for multitask imitation

D Brandfonbrener, O Nachum… - Advances in Neural …, 2023 - proceedings.neurips.cc
In recent years, domains such as natural language processing and image recognition have
popularized the paradigm of using large datasets to pretrain representations that can be …

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

Causal dynamics learning for task-independent state abstraction

Z Wang, X **ao, Z Xu, Y Zhu, P Stone - arxiv preprint arxiv:2206.13452, 2022 - arxiv.org
Learning dynamics models accurately is an important goal for Model-Based Reinforcement
Learning (MBRL), but most MBRL methods learn a dense dynamics model which is …