Manigaussian: Dynamic gaussian splatting for multi-task robotic manipulation
Performing language-conditioned robotic manipulation tasks in unstructured environments
is highly demanded for general intelligent robots. Conventional robotic manipulation …
is highly demanded for general intelligent robots. Conventional robotic manipulation …
Bunny-visionpro: Real-time bimanual dexterous teleoperation for imitation learning
Teleoperation is a crucial tool for collecting human demonstrations, but controlling robots
with bimanual dexterous hands remains a challenge. Existing teleoperation systems …
with bimanual dexterous hands remains a challenge. Existing teleoperation systems …
Affordancellm: Grounding affordance from vision language models
Affordance grounding refers to the task of finding the area of an object with which one can
interact. It is a fundamental but challenging task as a successful solution requires the …
interact. It is a fundamental but challenging task as a successful solution requires the …
Manipulate-anything: Automating real-world robots using vision-language models
Large-scale endeavors like and widespread community efforts such as Open-X-Embodiment
have contributed to growing the scale of robot demonstration data. However, there is still an …
have contributed to growing the scale of robot demonstration data. However, there is still an …
Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation
Abstract This paper introduces Hierarchical Diffusion Policy (HDP) a hierarchical agent for
multi-task robotic manipulation. HDP factorises a manipulation policy into a hierarchical …
multi-task robotic manipulation. HDP factorises a manipulation policy into a hierarchical …
READ: Retrieval-Enhanced Asymmetric Diffusion for Motion Planning
Abstract This paper proposes Retrieval-Enhanced Asymmetric Diffusion (READ) for image-
based robot motion planning. Given an image of the scene READ retrieves an initial motion …
based robot motion planning. Given an image of the scene READ retrieves an initial motion …
Deep generative models in robotics: A survey on learning from multimodal demonstrations
Learning from Demonstrations, the field that proposes to learn robot behavior models from
data, is gaining popularity with the emergence of deep generative models. Although the …
data, is gaining popularity with the emergence of deep generative models. Although the …
Learning to manipulate anywhere: A visual generalizable framework for reinforcement learning
Can we endow visuomotor robots with generalization capabilities to operate in diverse open-
world scenarios? In this paper, we propose\textbf {Maniwhere}, a generalizable framework …
world scenarios? In this paper, we propose\textbf {Maniwhere}, a generalizable framework …
3d diffuser actor: Policy diffusion with 3d scene representations
We marry diffusion policies and 3D scene representations for robot manipulation. Diffusion
policies learn the action distribution conditioned on the robot and environment state using …
policies learn the action distribution conditioned on the robot and environment state using …
Fourier transporter: Bi-equivariant robotic manipulation in 3d
Many complex robotic manipulation tasks can be decomposed as a sequence of pick and
place actions. Training a robotic agent to learn this sequence over many different starting …
place actions. Training a robotic agent to learn this sequence over many different starting …