A survey of imitation learning: Algorithms, recent developments, and challenges
M Zare, PM Kebria, A Khosravi… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In recent years, the development of robotics and artificial intelligence (AI) systems has been
nothing short of remarkable. As these systems continue to evolve, they are being utilized in …
nothing short of remarkable. As these systems continue to evolve, they are being utilized in …
Diffusion policy: Visuomotor policy learning via action diffusion
This paper introduces Diffusion Policy, a new way of generating robot behavior by
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …
Code as policies: Language model programs for embodied control
Large language models (LLMs) trained on code-completion have been shown to be capable
of synthesizing simple Python programs from docstrings [1]. We find that these code-writing …
of synthesizing simple Python programs from docstrings [1]. We find that these code-writing …
Perceiver-actor: A multi-task transformer for robotic manipulation
Transformers have revolutionized vision and natural language processing with their ability to
scale with large datasets. But in robotic manipulation, data is both limited and expensive …
scale with large datasets. But in robotic manipulation, data is both limited and expensive …
Offline reinforcement learning with implicit q-learning
Offline reinforcement learning requires reconciling two conflicting aims: learning a policy that
improves over the behavior policy that collected the dataset, while at the same time …
improves over the behavior policy that collected the dataset, while at the same time …
Scaling up and distilling down: Language-guided robot skill acquisition
We present a framework for robot skill acquisition, which 1) efficiently scale up data
generation of language-labelled robot data and 2) effectively distills this data down into a …
generation of language-labelled robot data and 2) effectively distills this data down into a …
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons
We provide a theoretical framework for Reinforcement Learning with Human Feedback
(RLHF). We show that when the underlying true reward is linear, under both Bradley-Terry …
(RLHF). We show that when the underlying true reward is linear, under both Bradley-Terry …
Learning fine-grained bimanual manipulation with low-cost hardware
Fine manipulation tasks, such as threading cable ties or slotting a battery, are notoriously
difficult for robots because they require precision, careful coordination of contact forces, and …
difficult for robots because they require precision, careful coordination of contact forces, and …
On neural differential equations
P Kidger - arxiv preprint arxiv:2202.02435, 2022 - arxiv.org
The conjoining of dynamical systems and deep learning has become a topic of great
interest. In particular, neural differential equations (NDEs) demonstrate that neural networks …
interest. In particular, neural differential equations (NDEs) demonstrate that neural networks …
Interactive language: Talking to robots in real time
We present a framework for building interactive, real-time, natural language-instructable
robots in the real world, and we open source related assets (dataset, environment …
robots in the real world, and we open source related assets (dataset, environment …