Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Diffusion policy: Visuomotor policy learning via action diffusion

C Chi, Z Xu, S Feng, E Cousineau… - … Journal of Robotics …, 2023 - journals.sagepub.com
This paper introduces Diffusion Policy, a new way of generating robot behavior by
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …

What matters in learning from offline human demonstrations for robot manipulation

A Mandlekar, D Xu, J Wong, S Nasiriany… - arxiv preprint arxiv …, 2021 - arxiv.org
Imitating human demonstrations is a promising approach to endow robots with various
manipulation capabilities. While recent advances have been made in imitation learning and …

Mimicplay: Long-horizon imitation learning by watching human play

C Wang, L Fan, J Sun, R Zhang, L Fei-Fei, D Xu… - arxiv preprint arxiv …, 2023 - arxiv.org
Imitation learning from human demonstrations is a promising paradigm for teaching robots
manipulation skills in the real world. However, learning complex long-horizon tasks often …

Goal-conditioned imitation learning using score-based diffusion policies

M Reuss, M Li, X Jia, R Lioutikov - arxiv preprint arxiv:2304.02532, 2023 - arxiv.org
We propose a new policy representation based on score-based diffusion models (SDMs).
We apply our new policy representation in the domain of Goal-Conditioned Imitation …

Viola: Imitation learning for vision-based manipulation with object proposal priors

Y Zhu, A Joshi, P Stone, Y Zhu - Conference on Robot …, 2023 - proceedings.mlr.press
We introduce VIOLA, an object-centric imitation learning approach to learning closed-loop
visuomotor policies for robot manipulation. Our approach constructs object-centric …

Baku: An efficient transformer for multi-task policy learning

S Haldar, Z Peng, L Pinto - Advances in Neural Information …, 2025 - proceedings.neurips.cc
Training generalist agents capable of solving diverse tasks is challenging, often requiring
large datasets of expert demonstrations. This is particularly problematic in robotics, where …

Mimicgen: A data generation system for scalable robot learning using human demonstrations

A Mandlekar, S Nasiriany, B Wen, I Akinola… - arxiv preprint arxiv …, 2023 - arxiv.org
Imitation learning from a large set of human demonstrations has proved to be an effective
paradigm for building capable robot agents. However, the demonstrations can be extremely …

Transfer learning in robotics: An upcoming breakthrough? A review of promises and challenges

N Jaquier, MC Welle, A Gams, K Yao… - … Journal of Robotics …, 2023 - journals.sagepub.com
Transfer learning is a conceptually-enticing paradigm in pursuit of truly intelligent embodied
agents. The core concept—reusing prior knowledge to learn in and from novel situations—is …

Xskill: Cross embodiment skill discovery

M Xu, Z Xu, C Chi, M Veloso… - Conference on robot …, 2023 - proceedings.mlr.press
Human demonstration videos are a widely available data source for robot learning and an
intuitive user interface for expressing desired behavior. However, directly extracting …