Google Академик

T Wang, A Torralba, P Isola… - … Conference on Machine …, 2023 - proceedings.mlr.press

In goal-reaching reinforcement learning (RL), the optimal value function has a particular
geometry, called quasimetrics structure. This paper introduces Quasimetric Reinforcement …

Сачувај Цитирај 38 пута наведен Сродни чланци Све верзије (8) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Facing off world model backbones: Rnns, transformers, and s4

F Deng, J Park, S Ahn - Advances in Neural Information …, 2023 - proceedings.neurips.cc

World models are a fundamental component in model-based reinforcement learning
(MBRL). To perform temporally extended and consistent simulations of the future in partially …

Сачувај Цитирај 24 пута наведен Сродни чланци Све верзије (6) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2023 - proceedings.neurips.cc

Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

Сачувај Цитирај 20 пута наведен Сродни чланци Све верзије (5) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

Structure in deep reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Сачувај Цитирај 23 пута наведен Сродни чланци Све верзије (10) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Denoised mdps: Learning world models better than the world itself

T Wang, SS Du, A Torralba, P Isola, A Zhang… - arxiv preprint arxiv …, 2022 - arxiv.org

The ability to separate signal from noise, and reason with clean abstractions, is critical to
intelligence. With this ability, humans can efficiently perform real world tasks without …

Сачувај Цитирај 55 пута наведен Сродни чланци Све верзије (4) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Dreamerpro: Reconstruction-free model-based reinforcement learning with prototypical representations

F Deng, I Jang, S Ahn - International Conference on …, 2022 - proceedings.mlr.press

Abstract Reconstruction-based Model-Based Reinforcement Learning (MBRL) agents, such
as Dreamer, often fail to discard task-irrelevant visual distractions that are prevalent in …

Сачувај Цитирај 63 пута наведен Сродни чланци Све верзије (10) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning

D Bertoin, A Zouitine, M Zouitine… - Advances in neural …, 2022 - proceedings.neurips.cc

Deep reinforcement learning policies, despite their outstanding efficiency in simulated visual
control tasks, have shown disappointing ability to generalize across disturbances in the …

Сачувај Цитирај 36 пута наведен Сродни чланци Све верзије (25) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Inverse dynamics pretraining learns good representations for multitask imitation

D Brandfonbrener, O Nachum… - Advances in Neural …, 2023 - proceedings.neurips.cc

In recent years, domains such as natural language processing and image recognition have
popularized the paradigm of using large datasets to pretrain representations that can be …

Сачувај Цитирај 19 пута наведен Сродни чланци Све верзије (9) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

Сачувај Цитирај 23 пута наведен Сродни чланци Све верзије (6) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Causal dynamics learning for task-independent state abstraction

Z Wang, X **ao, Z Xu, Y Zhu, P Stone - arxiv preprint arxiv:2206.13452, 2022 - arxiv.org

Learning dynamics models accurately is an important goal for Model-Based Reinforcement
Learning (MBRL), but most MBRL methods learn a dense dynamics model which is …

Сачувај Цитирај 45 пута наведен Сродни чланци Све верзије (14) HTML верзија

Направи обавештење

Цитирај

Напредна претрага

Сачувано у мојој библиотеци

Learning task informed abstractions

Optimal goal-reaching reinforcement learning via quasimetric learning

Facing off world model backbones: Rnns, transformers, and s4

On the importance of exploration for generalization in reinforcement learning

Structure in deep reinforcement learning: A survey and open problems

Denoised mdps: Learning world models better than the world itself

Dreamerpro: Reconstruction-free model-based reinforcement learning with prototypical representations

Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning

Inverse dynamics pretraining learns good representations for multitask imitation

Bridging state and history representations: Understanding self-predictive rl

Causal dynamics learning for task-independent state abstraction