Palm-e: An embodied multimodal language model
Large language models excel at a wide range of complex tasks. However, enabling general
inference in the real world, eg, for robotics problems, raises the challenge of grounding. We …
inference in the real world, eg, for robotics problems, raises the challenge of grounding. We …
Text2motion: From natural language instructions to feasible plans
Abstract We propose Text2Motion, a language-based planning framework enabling robots
to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural …
to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural …
A survey of optimization-based task and motion planning: From classical to learning approaches
Task and motion planning (TAMP) integrates high-level task planning and low-level motion
planning to equip robots with the autonomy to effectively reason over long-horizon, dynamic …
planning to equip robots with the autonomy to effectively reason over long-horizon, dynamic …
Learning multi-object dynamics with compositional neural radiance fields
We present a method to learn compositional multi-object dynamics models from image
observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and …
observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and …
Learning neuro-symbolic skills for bilevel planning
Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …
Decomposing 3d scenes into objects via unsupervised volume segmentation
We present ObSuRF, a method which turns a single image of a scene into a 3D model
represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to …
represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to …
Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs
We present a visually grounded hierarchical planning algorithm for long-horizon
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …
Ifor: Iterative flow minimization for robotic object rearrangement
Accurate object rearrangement from vision is a crucial problem for a wide variety of real-
world robotics applications in unstructured environments. We propose IFOR, Iterative Flow …
world robotics applications in unstructured environments. We propose IFOR, Iterative Flow …
Large language models for chemistry robotics
This paper proposes an approach to automate chemistry experiments using robots by
translating natural language instructions into robot-executable plans, using large language …
translating natural language instructions into robot-executable plans, using large language …
Learning symbolic operators for task and motion planning
Robotic planning problems in hybrid state and action spaces can be solved by integrated
task and motion planners (TAMP) that handle the complex interaction between motion-level …
task and motion planners (TAMP) that handle the complex interaction between motion-level …