- Academic Search

Representation learning with multi-step inverse kinematics: An efficient and optimal approach to rich-observation rl

Z Mhammedi, DJ Foster… - … Conference on Machine …, 2023 - proceedings.mlr.press

We study the design of sample-efficient algorithms for reinforcement learning in the
presence of rich, high-dimensional observations, formalized via the Block MDP problem …

Speichern Zitieren Zitiert von: 21 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Inverse dynamics pretraining learns good representations for multitask imitation

D Brandfonbrener, O Nachum… - Advances in Neural …, 2024 - proceedings.neurips.cc

In recent years, domains such as natural language processing and image recognition have
popularized the paradigm of using large datasets to pretrain representations that can be …

Speichern Zitieren Zitiert von: 16 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc

Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

Speichern Zitieren Zitiert von: 9 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Guide your agent with adaptive multimodal rewards

C Kim, Y Seo, H Liu, L Lee, J Shin… - Advances in Neural …, 2024 - proceedings.neurips.cc

Develo** an agent capable of adapting to unseen environments remains a difficult
challenge in imitation learning. This work presents Adaptive Return-conditioned Policy …

Speichern Zitieren Zitiert von: 9 Ähnliche Artikel Alle 9 Versionen HTML-Version

Masked and Inverse Dynamics Modeling for Data-Efficient Reinforcement Learning

YJ Lee, J Kim, YJ Park, M Kwak… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In pixel-based deep reinforcement learning (DRL), learning representations of states that
change because of an agent's action or interaction with the environment poses a critical …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] arxiv.org

Video Occupancy Models

M Tomar, P Hansen-Estruch, P Bachman… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce a new family of video prediction models designed to support downstream
control tasks. We call these models Video Occupancy models (VOCs). VOCs operate in a …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel HTML-Version

[Free GPT-4]

[PDF] arxiv.org

PcLast: Discovering Plannable Continuous Latent States

A Koul, S Sujit, S Chen, B Evans, L Wu, B Xu… - arxiv preprint arxiv …, 2023 - arxiv.org

Goal-conditioned planning benefits from learned low-dimensional representations of rich,
high-dimensional observations. While compact latent representations, typically learned from …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

D Kim, H Lee, K Lee, D Hwang, J Choo - arxiv preprint arxiv:2406.06037, 2024 - arxiv.org

Recently, various pre-training methods have been introduced in vision-based
Reinforcement Learning (RL). However, their generalization ability remains unclear due to …

Speichern Zitieren Zitiert von: 3 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Rich-Observation Reinforcement Learning with Continuous Latent Dynamics

Y Song, L Wu, DJ Foster, A Krishnamurthy - arxiv preprint arxiv …, 2024 - arxiv.org

Sample-efficiency and reliability remain major bottlenecks toward wide adoption of
reinforcement learning algorithms in continuous settings with high-dimensional perceptual …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Towards Principled Representation Learning from Videos for Reinforcement Learning

D Misra, A Saran, T **e, A Lamb, J Langford - arxiv preprint arxiv …, 2024 - arxiv.org

We study pre-training representations for decision-making using video data, which is
abundantly available for tasks such as game agents and software testing. Even though …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 3 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Agent-controller representations: Principled offline rl with rich exogenous information

Representation learning with multi-step inverse kinematics: An efficient and optimal approach to rich-observation rl

Inverse dynamics pretraining learns good representations for multitask imitation

Ignorance is bliss: Robust control via information gating

Guide your agent with adaptive multimodal rewards

Masked and Inverse Dynamics Modeling for Data-Efficient Reinforcement Learning

Video Occupancy Models

PcLast: Discovering Plannable Continuous Latent States

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

Rich-Observation Reinforcement Learning with Continuous Latent Dynamics

Towards Principled Representation Learning from Videos for Reinforcement Learning