A survey of imitation learning: Algorithms, recent developments, and challenges

M Zare, PM Kebria, A Khosravi… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In recent years, the development of robotics and artificial intelligence (AI) systems has been
nothing short of remarkable. As these systems continue to evolve, they are being utilized in …

Rt-h: Action hierarchies using language

S Belkhale, T Ding, T ** for offline safe reinforcement learning
Y Yao, Z Cen, W Ding, H Lin, S Liu… - Advances in …, 2025 - proceedings.neurips.cc
Offline safe reinforcement learning (RL) aims to train a policy that satisfies con-straints using
a pre-collected dataset. Most current methods struggle with the mismatch between imperfect …

Efficient data collection for robotic manipulation via compositional generalization

J Gao, A **e, T **ao, C Finn, D Sadigh - arxiv preprint arxiv:2403.05110, 2024 - arxiv.org
Data collection has become an increasingly important problem in robotic manipulation, yet
there still lacks much understanding of how to effectively collect data to facilitate broad …

Inverse-rlignment: Inverse reinforcement learning from demonstrations for llm alignment

H Sun, M van der Schaar - arxiv preprint arxiv:2405.15624, 2024 - arxiv.org
Aligning Large Language Models (LLMs) is crucial for enhancing their safety and utility.
However, existing methods, primarily based on preference datasets, face challenges such …

Re-mix: Optimizing data mixtures for large scale imitation learning

J Hejna, C Bhateja, Y Jian, K Pertsch… - arxiv preprint arxiv …, 2024 - arxiv.org
Increasingly large imitation learning datasets are being collected with the goal of training
foundation models for robotics. However, despite the fact that data selection has been of …

Incremental learning of retrievable skills for efficient continual task adaptation

D Lee, M Yoo, WK Kim, W Choi… - Advances in Neural …, 2025 - proceedings.neurips.cc
Abstract Continual Imitation Learning (CiL) involves extracting and accumulating task
knowledge from demonstrations across multiple stages and tasks to achieve a multi-task …