Академия Google

D Chen, P Krähenbühl - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

In this paper, we present a system to train driving policies from experiences collected not just
from the ego-vehicle, but all vehicles that it observes. This system uses the behaviors of …

Сохранить Цитировать Цитируется: 196 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Predictive representations: Building blocks of intelligence

W Carvalho, MS Tomov, W de Cothi, C Barry… - Neural …, 2024 - direct.mit.edu

Adaptive behavior often requires predicting future events. The theory of reinforcement
learning prescribes what kinds of predictive representations are useful and how to compute …

Сохранить Цитировать Цитируется: 8 Похожие статьи Все версии статьи (9)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Trihelper: Zero-shot object navigation with dynamic assistance

L Zhang, Q Zhang, H Wang, E **ao… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org

Navigating toward specific objects in unknown environments without additional training,
known as Zero-Shot object navigation, poses a significant challenge in the field of robotics …

Сохранить Цитировать Цитируется: 8 Похожие статьи Все версии статьи (4)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Combining behaviors with the successor features keyboard

WC Carvalho, A Saraiva, A Filos… - Advances in neural …, 2023 - proceedings.neurips.cc

Abstract The Option Keyboard (OK) was recently proposed as a method for transferring
behavioral knowledge across tasks. OK transfers knowledge by adaptively combining …

Сохранить Цитировать Цитируется: 6 Похожие статьи Все версии статьи (9) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cacto: Continuous actor-critic with trajectory optimization—towards global optimality

G Grandesso, E Alboni, GPR Papini… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

This letter presents a novel algorithm for the continuous control of dynamical systems that
combines Trajectory Optimization (TO) and Reinforcement Learning (RL) in a single …

Сохранить Цитировать Цитируется: 19 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Self-supervised reinforcement learning that transfers using random features

B Chen, C Zhu, P Agrawal… - Advances in Neural …, 2023 - proceedings.neurips.cc

Abstract Model-free reinforcement learning algorithms have exhibited great potential in
solving single-task sequential decision-making problems with high-dimensional …

Сохранить Цитировать Цитируется: 7 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Spatial-temporal causality modeling for industrial processes with a knowledge-data guided reinforcement learning

X Zhang, C Song, J Zhao, Z Xu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Causality in an industrial process provides insights into how various process variables
interact and affect each other within the system. It reveals the underlying mechanisms of …

Сохранить Цитировать Цитируется: 7 Похожие статьи Все версии статьи (2)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Composing task knowledge with modular successor feature approximators

W Carvalho, A Filos, RL Lewis, S Singh - arxiv preprint arxiv …, 2023 - arxiv.org

Recently, the Successor Features and Generalized Policy Improvement (SF&GPI) framework
has been proposed as a method for learning, composing, and transferring predictive …

Сохранить Цитировать Цитируется: 12 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Contrastive value learning: Implicit models for simple offline rl

B Mazoure, B Eysenbach, O Nachum… - … on Robot Learning, 2023 - proceedings.mlr.press

Abstract Model-based reinforcement learning (RL) methods are appealing in the offline
setting because they allow an agent to reason about the consequences of actions without …

Сохранить Цитировать Цитируется: 8 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review

H Hassani, R Razavi-Far, M Saif, L Lin - arxiv preprint arxiv:2411.10268, 2024 - arxiv.org

Reinforcement learning (RL) is a sub-domain of machine learning, mainly concerned with
solving sequential decision-making problems by a learning agent that interacts with the …

Сохранить Цитировать Похожие статьи Все версии статьи (2) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Psiphi-learning: Reinforcement learning with demonstrations using successor features and...

Learning from all vehicles

Predictive representations: Building blocks of intelligence

Trihelper: Zero-shot object navigation with dynamic assistance

Combining behaviors with the successor features keyboard

Cacto: Continuous actor-critic with trajectory optimization—towards global optimality

Self-supervised reinforcement learning that transfers using random features

Spatial-temporal causality modeling for industrial processes with a knowledge-data guided reinforcement learning

Composing task knowledge with modular successor feature approximators

Contrastive value learning: Implicit models for simple offline rl

Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review