Inference via interpolation: Contrastive representations provably enable planning and inference

B Eysenbach, V Myers… - Advances in Neural …, 2025 - proceedings.neurips.cc
Given time series data, how can we answer questions like what will happen in the
future?''and how did we get here?''These sorts of probabilistic inference questions are …

Reinforcement learning: An overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

[HTML][HTML] Investigating the properties of neural network representations in reinforcement learning

H Wang, E Miahi, M White, MC Machado, Z Abbas… - Artificial Intelligence, 2024 - Elsevier
In this paper we investigate the properties of representations learned by deep reinforcement
learning systems. Much of the early work on representations for reinforcement learning …

Predictive auxiliary objectives in deep rl mimic learning in the brain

C Fang, KL Stachenfeld - arxiv preprint arxiv:2310.06089, 2023 - arxiv.org
The ability to predict upcoming events has been hypothesized to comprise a key aspect of
natural and machine cognition. This is supported by trends in deep reinforcement learning …

A unified view on solving objective mismatch in model-based reinforcement learning

R Wei, N Lambert, A McDonald, A Garcia… - arxiv preprint arxiv …, 2023 - arxiv.org
Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient,
adaptive, and explainable by learning an explicit model of the environment. While the …

Representations and exploration for deep reinforcement learning using singular value decomposition

Y Chandak, S Thakoor, ZD Guo… - International …, 2023 - proceedings.mlr.press
Abstract Representation learning and exploration are among the key challenges for any
deep reinforcement learning agent. In this work, we provide a singular value decomposition …

Cross-domain policy adaptation by capturing representation mismatch

J Lyu, C Bai, J Yang, Z Lu, X Li - arxiv preprint arxiv:2405.15369, 2024 - arxiv.org
It is vital to learn effective policies that can be transferred to different domains with dynamics
discrepancies in reinforcement learning (RL). In this paper, we consider dynamics …

Self-predictive universal AI

E Catt, J Grau-Moya, M Hutter… - Advances in …, 2023 - proceedings.neurips.cc
Reinforcement Learning (RL) algorithms typically utilize learning and/or planning
techniques to derive effective policies. The integration of both approaches has proven to be …

Curiosity in hindsight: Intrinsic exploration in stochastic environments

D Jarrett, C Tallec, F Altché, T Mesnard… - arxiv preprint arxiv …, 2022 - arxiv.org
Consider the problem of exploration in sparse-reward or reward-free environments, such as
in Montezuma's Revenge. In the curiosity-driven paradigm, the agent is rewarded for how …