Metra: Scalable unsupervised rl with metric-aware abstraction

S Park, O Rybkin, S Levine - arxiv preprint arxiv:2310.08887, 2023 - arxiv.org
Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

[HTML][HTML] On efficient computation in active inference

A Paul, N Sajid, L Da Costa, A Razi - Expert Systems with Applications, 2024 - Elsevier
Biological agents demonstrate a remarkable proficiency in calibrating appropriate scales of
planning and evaluation when interacting with their environments. It follows logically that …

A mixture of surprises for unsupervised reinforcement learning

A Zhao, M Lin, Y Li, YJ Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc
Unsupervised reinforcement learning aims at learning a generalist policy in a reward-free
manner for fast adaptation to downstream tasks. Most of the existing methods propose to …

[ΒΙΒΛΙΟ][B] Natural General Intelligence: How understanding the brain can help us build AI

C Summerfield - 2022 - books.google.com
Since the time of Turing, computer scientists have dreamed of building artificial general
intelligence (AGI)-a system that can think, learn and act as humans do. Over recent years …

Predictable mdp abstraction for unsupervised model-based rl

S Park, S Levine - International Conference on Machine …, 2023 - proceedings.mlr.press
A key component of model-based reinforcement learning (RL) is a dynamics model that
predicts the outcomes of actions. Errors in this predictive model can degrade the …

Accelerating Goal-Conditioned RL Algorithms and Research

M Bortkiewicz, W Pałucki, V Myers, T Dziarmaga… - arxiv preprint arxiv …, 2024 - arxiv.org
Self-supervision has the potential to transform reinforcement learning (RL), paralleling the
breakthroughs it has enabled in other areas of machine learning. While self-supervised …

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

R Wei, N Lambert, A McDonald, A Garcia… - arxiv preprint arxiv …, 2023 - arxiv.org
Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient,
adaptive, and explainable by learning an explicit model of the environment. While the …

When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination

M Benfeghoul, U Zahid, Q Guo, Z Fountas - arxiv preprint arxiv …, 2024 - arxiv.org
In an unfamiliar setting, a model-based reinforcement learning agent can be limited by the
accuracy of its world model. In this work, we present a novel, training-free approach to …

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning

A Hugessen, RC Castanyer, F Mohamed… - arxiv preprint arxiv …, 2024 - arxiv.org
Both entropy-minimizing and entropy-maximizing (curiosity) objectives for unsupervised
reinforcement learning (RL) have been shown to be effective in different environments …

Intrinsic exploration for reinforcement learning beyond rewards

R Creus-Castanyer - 2024 - papyrus.bib.umontreal.ca
In reinforcement learning, a reward function is used to guide the agent's behavior towards
task-specific objectives. However, such extrinsic rewards often fall short in complex …