Learning latent dynamic robust representations for world models

R Sun, H Zang, X Li, R Islam - arxiv preprint arxiv:2405.06263, 2024 - arxiv.org
Visual Model-Based Reinforcement Learning (MBRL) promises to encapsulate agent's
knowledge about the underlying dynamics of the environment, enabling learning a world …

Madi: Learning to mask distractions for generalization in visual deep reinforcement learning

B Grooten, T Tomilin, G Vasan, ME Taylor… - arxiv preprint arxiv …, 2023 - arxiv.org
The visual world provides an abundance of information, but many input pixels received by
agents often contain distracting stimuli. Autonomous agents need the ability to distinguish …

Policy-shaped prediction: avoiding distractions in model-based reinforcement learning

M Hutson, I Kauvar, N Haber - Advances in Neural …, 2025 - proceedings.neurips.cc
Abstract Model-based reinforcement learning (MBRL) is a promising route to sample-
efficient policy optimization. However, a known vulnerability of reconstruction-based MBRL …

An unsupervised approach to achieve supervised-level explainability in healthcare records

J Edin, M Maistro, L Maaløe, L Borgholt… - arxiv preprint arxiv …, 2024 - arxiv.org
Electronic healthcare records are vital for patient safety as they document conditions, plans,
and procedures in both free text and medical codes. Language models have significantly …

SeMOPO: learning high-quality model and policy from low-quality offline visual datasets

S Wan, Z Chen, L Gan, S Feng, DC Zhan - arxiv preprint arxiv:2406.09486, 2024 - arxiv.org
Model-based offline reinforcement Learning (RL) is a promising approach that leverages
existing data effectively in many real-world applications, especially those involving high …

Curb Your Attention: Causal Attention Gating for Robust Trajectory Prediction in Autonomous Driving

E Ahmadi, R Mercurius, S Alizadeh, K Rezaee… - arxiv preprint arxiv …, 2024 - arxiv.org
Trajectory prediction models in autonomous driving are vulnerable to perturbations from non-
causal agents whose actions should not affect the ego-agent's behavior. Such perturbations …

Unified Auto-Encoding with Masked Diffusion

P Hansen-Estruch, S Vishwanath, A Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
At the core of both successful generative and self-supervised representation learning
models there is a reconstruction objective that incorporates some form of image corruption …

Learning Versatile Skills with Curriculum Masking

Y Tang, Z **e, Z Lin, D Ye, S Li - The Thirty-eighth Annual …, 2024 - openreview.net
Masked prediction has emerged as a promising pretraining paradigm in offline
reinforcement learning (RL) due to its versatile masking schemes, enabling flexible …

Denoised Predictive Imagination: An Information-theoretic approach for learning World Models

V Dave, E Rueckert - Seventeenth European Workshop on Reinforcement … - openreview.net
Humans excel at isolating relevant information from noisy data to predict the behavior of
dynamic systems, effectively disregarding non-informative, temporally-correlated noise. In …

CLEAR: An Information-Theoretic Framework for Distraction-Free Representation Learning in Visual Offline RL

TW Guntara, DE Matsunaga, HJ Hwang, KE Kim - openreview.net
Visual offline RL aims to learn an optimal policy for visual domains, solely from the pre-
collected dataset comprised of actions taken on visual observations. Prior works on visual …