From imitation to refinement–residual rl for precise visual assembly
Recent advances in behavior cloning (BC), like action-chunking and diffusion, have led to
impressive progress. Still, imitation alone remains insufficient for tasks requiring reliable and …
impressive progress. Still, imitation alone remains insufficient for tasks requiring reliable and …
Good Data Is All Imitation Learning Needs
In this paper, we address the limitations of traditional teacher-student models, imitation
learning, and behaviour cloning in the context of Autonomous/Automated Driving Systems …
learning, and behaviour cloning in the context of Autonomous/Automated Driving Systems …
Auginsert: Learning robust visual-force policies via data augmentation for object assembly tasks
This paper primarily focuses on learning robust visual-force policies in the context of high-
precision object assembly tasks. Specifically, we focus on the contact phase of the assembly …
precision object assembly tasks. Specifically, we focus on the contact phase of the assembly …
From Imitation to Refinement--Residual RL for Precise Assembly
Recent advances in behavior cloning (BC), like action-chunking and diffusion, have led to
impressive progress. Still, imitation alone remains insufficient for tasks requiring reliable and …
impressive progress. Still, imitation alone remains insufficient for tasks requiring reliable and …
Learning from Demonstration with Implicit Nonlinear Dynamics Models
Learning from Demonstration (LfD) is a useful paradigm for training policies that solve tasks
involving complex motions, such as those encountered in robotic manipulation. In practice …
involving complex motions, such as those encountered in robotic manipulation. In practice …
Sample-Efficient Behavior Cloning Using General Domain Knowledge
Behavior cloning has shown success in many sequential decision-making tasks by learning
from expert demonstrations, yet they can be very sample inefficient and fail to generalize to …
from expert demonstrations, yet they can be very sample inefficient and fail to generalize to …