Google Наука

Representation learning with multi-step inverse kinematics: An efficient and optimal approach to rich-observation rl

Z Mhammedi, DJ Foster… - … Conference on Machine …, 2023 - proceedings.mlr.press

We study the design of sample-efficient algorithms for reinforcement learning in the
presence of rich, high-dimensional observations, formalized via the Block MDP problem …

Запазване Позоваване С позовавания в 24 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Inverse dynamics pretraining learns good representations for multitask imitation

D Brandfonbrener, O Nachum… - Advances in Neural …, 2023 - proceedings.neurips.cc

In recent years, domains such as natural language processing and image recognition have
popularized the paradigm of using large datasets to pretrain representations that can be …

Запазване Позоваване С позовавания в 20 Сродни статии Всички 9 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Guide your agent with adaptive multimodal rewards

C Kim, Y Seo, H Liu, L Lee, J Shin… - Advances in Neural …, 2023 - proceedings.neurips.cc

Develo** an agent capable of adapting to unseen environments remains a difficult
challenge in imitation learning. This work presents Adaptive Return-conditioned Policy …

Запазване Позоваване С позовавания в 9 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc

Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

Запазване Позоваване С позовавания в 11 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning latent dynamic robust representations for world models

R Sun, H Zang, X Li, R Islam - arxiv preprint arxiv:2405.06263, 2024 - arxiv.org

Visual Model-Based Reinforcement Learning (MBRL) promises to encapsulate agent's
knowledge about the underlying dynamics of the environment, enabling learning a world …

Запазване Позоваване С позовавания в 3 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Investigating pre-training objectives for generalization in vision-based reinforcement learning

D Kim, H Lee, K Lee, D Hwang, J Choo - arxiv preprint arxiv:2406.06037, 2024 - arxiv.org

Recently, various pre-training methods have been introduced in vision-based
Reinforcement Learning (RL). However, their generalization ability remains unclear due to …

Запазване Позоваване С позовавания в 3 Сродни статии Всички 9 версии Във вид на HTML

Masked and Inverse Dynamics Modeling for Data-Efficient Reinforcement Learning

YJ Lee, J Kim, YJ Park, M Kwak… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In pixel-based deep reinforcement learning (DRL), learning representations of states that
change because of an agent's action or interaction with the environment poses a critical …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 3 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rich-observation reinforcement learning with continuous latent dynamics

Y Song, L Wu, DJ Foster, A Krishnamurthy - arxiv preprint arxiv …, 2024 - arxiv.org

Sample-efficiency and reliability remain major bottlenecks toward wide adoption of
reinforcement learning algorithms in continuous settings with high-dimensional perceptual …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Video Occupancy Models

M Tomar, P Hansen-Estruch, P Bachman… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce a new family of video prediction models designed to support downstream
control tasks. We call these models Video Occupancy models (VOCs). VOCs operate in a …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PcLast: Discovering plannable continuous latent states

A Koul, S Sujit, S Chen, B Evans, L Wu, B Xu… - arxiv preprint arxiv …, 2023 - arxiv.org

Goal-conditioned planning benefits from learned low-dimensional representations of rich
observations. While compact latent representations typically learned from variational …

Запазване Позоваване С позовавания в 2 Сродни статии Всички 6 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Agent-controller representations: Principled offline rl with rich exogenous information

Representation learning with multi-step inverse kinematics: An efficient and optimal approach to rich-observation rl

Inverse dynamics pretraining learns good representations for multitask imitation

Guide your agent with adaptive multimodal rewards

Ignorance is bliss: Robust control via information gating

Learning latent dynamic robust representations for world models

Investigating pre-training objectives for generalization in vision-based reinforcement learning

Masked and Inverse Dynamics Modeling for Data-Efficient Reinforcement Learning

Rich-observation reinforcement learning with continuous latent dynamics

Video Occupancy Models

PcLast: Discovering plannable continuous latent states