Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc
We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

What matters in learning from offline human demonstrations for robot manipulation

A Mandlekar, D Xu, J Wong, S Nasiriany… - arxiv preprint arxiv …, 2021 - arxiv.org
Imitating human demonstrations is a promising approach to endow robots with various
manipulation capabilities. While recent advances have been made in imitation learning and …

Behavior Transformers: Cloning modes with one stone

NM Shafiullah, Z Cui… - Advances in neural …, 2022 - proceedings.neurips.cc
While behavior learning has made impressive progress in recent times, it lags behind
computer vision and natural language processing due to its inability to leverage large …

Playfusion: Skill acquisition via diffusion from language-annotated play

L Chen, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press
Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …

Imitating human behaviour with diffusion models

T Pearce, T Rashid, A Kanervisto, D Bignell… - arxiv preprint arxiv …, 2023 - arxiv.org
Diffusion models have emerged as powerful generative models in the text-to-image domain.
This paper studies their application as observation-to-action models for imitating human …

Goal-conditioned reinforcement learning with imagined subgoals

E Chane-Sane, C Schmid… - … conference on machine …, 2021 - proceedings.mlr.press
Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it
often struggles to solve tasks that require more temporally extended reasoning. In this work …

Offline-to-online reinforcement learning via balanced replay and pessimistic q-ensemble

S Lee, Y Seo, K Lee, P Abbeel… - Conference on Robot …, 2022 - proceedings.mlr.press
Recent advance in deep offline reinforcement learning (RL) has made it possible to train
strong robotic agents from offline datasets. However, depending on the quality of the trained …

Goal-conditioned imitation learning using score-based diffusion policies

M Reuss, M Li, X Jia, R Lioutikov - arxiv preprint arxiv:2304.02532, 2023 - arxiv.org
We propose a new policy representation based on score-based diffusion models (SDMs).
We apply our new policy representation in the domain of Goal-Conditioned Imitation …

Hiql: Offline goal-conditioned rl with latent states as actions

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

State2explanation: Concept-based explanations to benefit agent learning and user understanding

D Das, S Chernova, B Kim - Advances in Neural …, 2023 - proceedings.neurips.cc
As more non-AI experts use complex AI systems for daily tasks, there has been an
increasing effort to develop methods that produce explanations of AI decision making that …