Palm-e: An embodied multimodal language model

D Driess, F **a, MSM Sajjadi, C Lynch… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models excel at a wide range of complex tasks. However, enabling general
inference in the real world, eg, for robotics problems, raises the challenge of grounding. We …

Text2motion: From natural language instructions to feasible plans

K Lin, C Agia, T Migimatsu, M Pavone, J Bohg - Autonomous Robots, 2023 - Springer
Abstract We propose Text2Motion, a language-based planning framework enabling robots
to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural …

A survey of optimization-based task and motion planning: From classical to learning approaches

Z Zhao, S Cheng, Y Ding, Z Zhou… - IEEE/ASME …, 2024 - ieeexplore.ieee.org
Task and motion planning (TAMP) integrates high-level task planning and low-level motion
planning to equip robots with the autonomy to effectively reason over long-horizon, dynamic …

Learning multi-object dynamics with compositional neural radiance fields

D Driess, Z Huang, Y Li, R Tedrake… - Conference on robot …, 2023 - proceedings.mlr.press
We present a method to learn compositional multi-object dynamics models from image
observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and …

Learning neuro-symbolic skills for bilevel planning

T Silver, A Athalye, JB Tenenbaum… - arxiv preprint arxiv …, 2022 - arxiv.org
Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …

Decomposing 3d scenes into objects via unsupervised volume segmentation

K Stelzner, K Kersting, AR Kosiorek - arxiv preprint arxiv:2104.01148, 2021 - arxiv.org
We present ObSuRF, a method which turns a single image of a scene into a 3D model
represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to …

Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs

Y Zhu, J Tremblay, S Birchfield… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
We present a visually grounded hierarchical planning algorithm for long-horizon
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …

Ifor: Iterative flow minimization for robotic object rearrangement

A Goyal, A Mousavian, C Paxton… - Proceedings of the …, 2022 - openaccess.thecvf.com
Accurate object rearrangement from vision is a crucial problem for a wide variety of real-
world robotics applications in unstructured environments. We propose IFOR, Iterative Flow …

Large language models for chemistry robotics

N Yoshikawa, M Skreta, K Darvish… - Autonomous …, 2023 - Springer
This paper proposes an approach to automate chemistry experiments using robots by
translating natural language instructions into robot-executable plans, using large language …

Learning symbolic operators for task and motion planning

T Silver, R Chitnis, J Tenenbaum… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
Robotic planning problems in hybrid state and action spaces can be solved by integrated
task and motion planners (TAMP) that handle the complex interaction between motion-level …