World models and predictive coding for cognitive and developmental robotics: frontiers and challenges

T Taniguchi, S Murata, M Suzuki, D Ognibene… - Advanced …, 2023 - Taylor & Francis
Creating autonomous robots that can actively explore the environment, acquire knowledge
and learn skills continuously is the ultimate achievement envisioned in cognitive and …

Optimal goal-reaching reinforcement learning via quasimetric learning

T Wang, A Torralba, P Isola… - … Conference on Machine …, 2023 - proceedings.mlr.press
In goal-reaching reinforcement learning (RL), the optimal value function has a particular
geometry, called quasimetrics structure. This paper introduces Quasimetric Reinforcement …

Structure in deep reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Reinforcement learning: An overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

Learning world models with identifiable factorization

Y Liu, B Huang, Z Zhu, H Tian… - Advances in Neural …, 2023 - proceedings.neurips.cc
Extracting a stable and compact representation of the environment is crucial for efficient
reinforcement learning in high-dimensional, noisy, and non-stationary environments …

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

C Zhu, M Simchowitz, S Gadipudi… - Advances in Neural …, 2023 - proceedings.neurips.cc
Visual model-based RL methods typically encode image observations into low-dimensional
representations in a manner that does not eliminate redundant information. This leaves them …

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc
Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

Building minimal and reusable causal state abstractions for reinforcement learning

Z Wang, C Wang, X **ao, Y Zhu, P Stone - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Two desiderata of reinforcement learning (RL) algorithms are the ability to learn from
relatively little experience and the ability to learn policies that generalize to a range of …

Guaranteed discovery of control-endogenous latent states with multi-step inverse models

A Lamb, R Islam, Y Efroni, A Didolkar, D Misra… - arxiv preprint arxiv …, 2022 - arxiv.org
In many sequential decision-making tasks, the agent is not able to model the full complexity
of the world, which consists of multitudes of relevant and irrelevant information. For example …