- Academic Search

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

Save Cite Cited by 1770 Related articles All 11 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] openreview.net

What matters in learning from offline human demonstrations for robot manipulation

A Mandlekar, D Xu, J Wong, S Nasiriany… - arxiv preprint arxiv …, 2021 - arxiv.org

Imitating human demonstrations is a promising approach to endow robots with various
manipulation capabilities. While recent advances have been made in imitation learning and …

Save Cite Cited by 414 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Behavior Transformers: Cloning modes with one stone

NM Shafiullah, Z Cui… - Advances in neural …, 2022 - proceedings.neurips.cc

While behavior learning has made impressive progress in recent times, it lags behind
computer vision and natural language processing due to its inability to leverage large …

Save Cite Cited by 188 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Playfusion: Skill acquisition via diffusion from language-annotated play

L Chen, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press

Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …

Save Cite Cited by 40 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Imitating human behaviour with diffusion models

T Pearce, T Rashid, A Kanervisto, D Bignell… - arxiv preprint arxiv …, 2023 - arxiv.org

Diffusion models have emerged as powerful generative models in the text-to-image domain.
This paper studies their application as observation-to-action models for imitating human …

Save Cite Cited by 174 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Goal-conditioned reinforcement learning with imagined subgoals

E Chane-Sane, C Schmid… - … conference on machine …, 2021 - proceedings.mlr.press

Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it
often struggles to solve tasks that require more temporally extended reasoning. In this work …

Save Cite Cited by 157 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Offline-to-online reinforcement learning via balanced replay and pessimistic q-ensemble

S Lee, Y Seo, K Lee, P Abbeel… - Conference on Robot …, 2022 - proceedings.mlr.press

Recent advance in deep offline reinforcement learning (RL) has made it possible to train
strong robotic agents from offline datasets. However, depending on the quality of the trained …

Save Cite Cited by 202 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Goal-conditioned imitation learning using score-based diffusion policies

M Reuss, M Li, X Jia, R Lioutikov - arxiv preprint arxiv:2304.02532, 2023 - arxiv.org

We propose a new policy representation based on score-based diffusion models (SDMs).
We apply our new policy representation in the domain of Goal-Conditioned Imitation …

Save Cite Cited by 116 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Hiql: Offline goal-conditioned rl with latent states as actions

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc

Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

Save Cite Cited by 40 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

State2explanation: Concept-based explanations to benefit agent learning and user understanding

D Das, S Chernova, B Kim - Advances in Neural …, 2023 - proceedings.neurips.cc

As more non-AI experts use complex AI systems for daily tasks, there has been an
increasing effort to develop methods that produce explanations of AI decision making that …

Save Cite Cited by 24 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Accelerating reinforcement learning with learned skill priors

Decision transformer: Reinforcement learning via sequence modeling

What matters in learning from offline human demonstrations for robot manipulation

Behavior Transformers: Cloning modes with one stone

Playfusion: Skill acquisition via diffusion from language-annotated play

Imitating human behaviour with diffusion models

Goal-conditioned reinforcement learning with imagined subgoals

Offline-to-online reinforcement learning via balanced replay and pessimistic q-ensemble

Goal-conditioned imitation learning using score-based diffusion policies

Hiql: Offline goal-conditioned rl with latent states as actions

State2explanation: Concept-based explanations to benefit agent learning and user understanding