Real-world robot applications of foundation models: A review
Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …
Language Models (VLMs), trained on extensive data, facilitate flexible application across …
Large language models for robotics: Opportunities, challenges, and perspectives
Large language models (LLMs) have undergone significant expansion and have been
increasingly integrated across various domains. Notably, in the realm of robot task planning …
increasingly integrated across various domains. Notably, in the realm of robot task planning …
Open x-embodiment: Robotic learning datasets and rt-x models
Large, high-capacity models trained on diverse datasets have shown remarkable successes
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration0
Large, high-capacity models trained on diverse datasets have shown remarkable successes
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
Gello: A general, low-cost, and intuitive teleoperation framework for robot manipulators
Humans can teleoperate robots to accomplish complex manipulation tasks. Imitation
learning has emerged as a powerful framework that leverages human teleoperated …
learning has emerged as a powerful framework that leverages human teleoperated …
Aloha unleashed: A simple recipe for robot dexterity
Recent work has shown promising results for learning end-to-end robot policies using
imitation learning. In this work we address the question of how far can we push imitation …
imitation learning. In this work we address the question of how far can we push imitation …
Fmb: a functional manipulation benchmark for generalizable robotic learning
In this paper, we propose a real-world benchmark for studying robotic learning in the context
of functional manipulation: a robot needs to accomplish complex long-horizon behaviors by …
of functional manipulation: a robot needs to accomplish complex long-horizon behaviors by …
Rh20t: A comprehensive robotic dataset for learning diverse skills in one-shot
A key challenge for robotic manipulation in open domains is how to acquire diverse and
generalizable skills for robots. Recent progress in one-shot imitation learning and robotic …
generalizable skills for robots. Recent progress in one-shot imitation learning and robotic …
Umi on legs: Making manipulation policies mobile with manipulation-centric whole-body controllers
We introduce UMI-on-Legs, a new framework that combines real-world and simulation data
for quadruped manipulation systems. We scale task-centric data collection in the real world …
for quadruped manipulation systems. We scale task-centric data collection in the real world …
Generative camera dolly: Extreme monocular dynamic novel view synthesis
Accurate reconstruction of complex dynamic scenes from just a single viewpoint continues to
be a challenging task in computer vision. Current dynamic novel view synthesis methods …
be a challenging task in computer vision. Current dynamic novel view synthesis methods …