Shapellm: Universal 3d object understanding for embodied interaction
This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …
designed for embodied interaction, exploring a universal 3D object understanding with 3D …
4d contrastive superflows are dense 3d representation learners
In the realm of autonomous driving, accurate 3D perception is the foundation. However,
develo** such models relies on extensive human annotations–a process that is both …
develo** such models relies on extensive human annotations–a process that is both …
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
Hands are the main medium when people interact with the world. Generating proper 3D
motion for hand-object interaction is vital for applications such as virtual reality and robotics …
motion for hand-object interaction is vital for applications such as virtual reality and robotics …
Eqvafford: Se (3) equivariance for point-level affordance learning
Humans perceive and interact with the world with the awareness of equivariance, facilitating
us in manipulating different objects in diverse poses. For robotic manipulation, such …
us in manipulating different objects in diverse poses. For robotic manipulation, such …
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models
Point cloud videos effectively capture real-world spatial geometries and temporal dynamics,
which are essential for enabling intelligent agents to understand the dynamically changing …
which are essential for enabling intelligent agents to understand the dynamically changing …