Robocook: Long-horizon elasto-plastic object manipulation with diverse tools
Humans excel in complex long-horizon soft body manipulation tasks via flexible tool use:
bread baking requires a knife to slice the dough and a rolling pin to flatten it. Often regarded …
bread baking requires a knife to slice the dough and a rolling pin to flatten it. Often regarded …
DFields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Robotic Manipulation
Scene representation has been a crucial design choice in robotic manipulation systems. An
ideal representation should be 3D, dynamic, and semantic to meet the demands of diverse …
ideal representation should be 3D, dynamic, and semantic to meet the demands of diverse …
See, hear, and feel: Smart sensory fusion for robotic manipulation
Humans use all of their senses to accomplish different tasks in everyday activities. In
contrast, existing work on robotic manipulation mostly relies on one, or occasionally two …
contrast, existing work on robotic manipulation mostly relies on one, or occasionally two …
Model-based control with sparse neural dynamics
Learning predictive models from observations using deep neural networks (DNNs) is a
promising new approach to many real-world planning and control problems. However …
promising new approach to many real-world planning and control problems. However …
Planning with spatial-temporal abstraction from point clouds for deformable object manipulation
Effective planning of long-horizon deformable object manipulation requires suitable
abstractions at both the spatial and temporal levels. Previous methods typically either focus …
abstractions at both the spatial and temporal levels. Previous methods typically either focus …
Understanding World or Predicting Future? A Comprehensive Survey of World Models
The concept of world models has garnered significant attention due to advancements in
multimodal large language models such as GPT-4 and video generation models such as …
multimodal large language models such as GPT-4 and video generation models such as …
Doughnet: A visual predictive model for topological manipulation of deformable objects
Manipulation of elastoplastic objects like dough often involves topological changes such as
splitting and merging. The ability to accurately predict these topological changes that a …
splitting and merging. The ability to accurately predict these topological changes that a …
DiffVL: scaling up soft body manipulation using vision-language driven differentiable physics
Combining gradient-based trajectory optimization with differentiable physics simulation is an
efficient technique for solving soft-body manipulation problems. Using a well-crafted …
efficient technique for solving soft-body manipulation problems. Using a well-crafted …
DFields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
Scene representation is a crucial design choice in robotic manipulation systems. An ideal
representation is expected to be 3D, dynamic, and semantic to meet the demands of diverse …
representation is expected to be 3D, dynamic, and semantic to meet the demands of diverse …
Dynamic-resolution model learning for object pile manipulation
Dynamics models learned from visual observations have shown to be effective in various
robotic manipulation tasks. One of the key questions for learning such dynamics models is …
robotic manipulation tasks. One of the key questions for learning such dynamics models is …