Deepmdp: Learning continuous latent space models for representation learning

C Gelada, S Kumar, J Buckman… - International …, 2019 - proceedings.mlr.press
Many reinforcement learning (RL) tasks provide the agent with high-dimensional
observations that can be simplified into low-dimensional continuous states. To formalize this …

On the effect of auxiliary tasks on representation dynamics

C Lyle, M Rowland, G Ostrovski… - International …, 2021 - proceedings.mlr.press
While auxiliary tasks play a key role in sha** the representations learnt by reinforcement
learning agents, much is still unknown about the mechanisms through which this is …

[BUKU][B] Distributional reinforcement learning

MG Bellemare, W Dabney, M Rowland - 2023 - books.google.com
The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …

A geometric perspective on optimal representations for reinforcement learning

M Bellemare, W Dabney, R Dadashi… - Advances in neural …, 2019 - proceedings.neurips.cc
We propose a new perspective on representation learning in reinforcement learning based
on geometric properties of the space of value functions. From there, we provide formal …

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey

P Li, J Hao, H Tang, X Fu, Y Zheng, K Tang - arxiv preprint arxiv …, 2024 - arxiv.org
Evolutionary Reinforcement Learning (ERL), which integrates Evolutionary Algorithms (EAs)
and Reinforcement Learning (RL) for optimization, has demonstrated remarkable …

On the generalization of representations in reinforcement learning

CL Lan, S Tu, A Oberman, R Agarwal… - arxiv preprint arxiv …, 2022 - arxiv.org
In reinforcement learning, state representations are used to tractably deal with large problem
spaces. State representations serve both to approximate the value function with few …

Online bootstrap inference for policy evaluation in reinforcement learning

P Ramprasad, Y Li, Z Yang, Z Wang… - Journal of the …, 2023 - Taylor & Francis
The recent emergence of reinforcement learning (RL) has created a demand for robust
statistical inference methods for the parameter estimates computed using these algorithms …

Representations for stable off-policy reinforcement learning

D Ghosh, MG Bellemare - International Conference on …, 2020 - proceedings.mlr.press
Reinforcement learning with function approximation can be unstable and even divergent,
especially when combined with off-policy learning and Bellman updates. In deep …

Reining generalization in offline reinforcement learning via representation distinction

Y Ma, H Tang, D Li, Z Meng - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Offline Reinforcement Learning (RL) aims to address the challenge of distribution
shift between the dataset and the learned policy, where the value of out-of-distribution …

Learning bellman complete representations for offline policy evaluation

J Chang, K Wang, N Kallus… - … Conference on Machine …, 2022 - proceedings.mlr.press
We study representation learning for Offline Reinforcement Learning (RL), focusing on the
important task of Offline Policy Evaluation (OPE). Recent work shows that, in contrast to …