Google Tudós

L Chen, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press

Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …

Mentés Hivatkozás Idézetek száma: 41 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Graph deep learning: State of the art and challenges

S Georgousis, MP Kenning, X **e - IEEE Access, 2021 - ieeexplore.ieee.org

The last half-decade has seen a surge in deep learning research on irregular domains and
efforts to extend convolutional neural networks (CNNs) to work on irregularly structured data …

Mentés Hivatkozás Idézetek száma: 69 Kapcsolódó cikkek Mind a(z) 8 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning

A Gupta, V Kumar, C Lynch, S Levine… - arxiv preprint arxiv …, 2019 - arxiv.org

We present relay policy learning, a method for imitation and reinforcement learning that can
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …

Mentés Hivatkozás Idézetek száma: 432 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Parrot: Data-driven behavioral priors for reinforcement learning

A Singh, H Liu, G Zhou, A Yu, N Rhinehart… - arxiv preprint arxiv …, 2020 - arxiv.org

Reinforcement learning provides a general framework for flexible decision making and
control, but requires extensive data collection for each new task that an agent needs to …

Mentés Hivatkozás Idézetek száma: 158 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning neuro-symbolic skills for bilevel planning

T Silver, A Athalye, JB Tenenbaum… - arxiv preprint arxiv …, 2022 - arxiv.org

Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …

Mentés Hivatkozás Idézetek száma: 82 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Generalizing goal-conditioned reinforcement learning with variational causal reasoning

W Ding, H Lin, B Li, D Zhao - Advances in Neural …, 2022 - proceedings.neurips.cc

As a pivotal component to attaining generalizable solutions in human intelligence,
reasoning provides great potential for reinforcement learning (RL) agents' generalization …

Mentés Hivatkozás Idézetek száma: 45 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-stage cable routing through hierarchical imitation learning

J Luo, C Xu, X Geng, G Feng, K Fang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

We study the problem of learning to perform multistage robotic manipulation tasks, with
applications to cable routing, where the robot must route a cable through a series of clips …

Mentés Hivatkozás Idézetek száma: 39 Kapcsolódó cikkek Mind a(z) 4 változat

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Hierarchical diffusion for offline decision making

W Li, X Wang, B **, H Zha - International Conference on …, 2023 - proceedings.mlr.press

Offline reinforcement learning typically introduces a hierarchical structure to solve the long-
horizon problem so as to address its thorny issue of variance accumulation. Problems of …

Mentés Hivatkozás Idézetek száma: 31 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Recent advances in leveraging human guidance for sequential decision-making tasks

R Zhang, F Torabi, G Warnell, P Stone - Autonomous Agents and Multi …, 2021 - Springer

A longstanding goal of artificial intelligence is to create artificial agents capable of learning
to perform tasks that require sequential decision making. Importantly, while it is the artificial …

Mentés Hivatkozás Idézetek száma: 36 Kapcsolódó cikkek Mind a(z) 6 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dichotomy of control: Separating what you can control from what you cannot

M Yang, D Schuurmans, P Abbeel… - arxiv preprint arxiv …, 2022 - arxiv.org

Future-or return-conditioned supervised learning is an emerging paradigm for offline
reinforcement learning (RL), where the future outcome (ie, return) associated with an …

Mentés Hivatkozás Idézetek száma: 47 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Compile: Compositional imitation learning and execution

Playfusion: Skill acquisition via diffusion from language-annotated play

Graph deep learning: State of the art and challenges

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning

Parrot: Data-driven behavioral priors for reinforcement learning

Learning neuro-symbolic skills for bilevel planning

Generalizing goal-conditioned reinforcement learning with variational causal reasoning

Multi-stage cable routing through hierarchical imitation learning

Hierarchical diffusion for offline decision making

Recent advances in leveraging human guidance for sequential decision-making tasks

Dichotomy of control: Separating what you can control from what you cannot