Deep learning approaches to grasp synthesis: A review
Gras** is the process of picking up an object by applying forces and torques at a set of
contacts. Recent advances in deep learning methods have allowed rapid progress in robotic …
contacts. Recent advances in deep learning methods have allowed rapid progress in robotic …
A survey on learning-based robotic gras**
Abstract Purpose of Review This review provides a comprehensive overview of machine
learning approaches for vision-based robotic gras** and manipulation. Current trends and …
learning approaches for vision-based robotic gras** and manipulation. Current trends and …
Rt-2: Vision-language-action models transfer web knowledge to robotic control
A Brohan, N Brown, J Carbajal, Y Chebotar… - arxiv preprint arxiv …, 2023 - arxiv.org
We study how vision-language models trained on Internet-scale data can be incorporated
directly into end-to-end robotic control to boost generalization and enable emergent …
directly into end-to-end robotic control to boost generalization and enable emergent …
Open x-embodiment: Robotic learning datasets and rt-x models
Large, high-capacity models trained on diverse datasets have shown remarkable successes
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
[HTML][HTML] Rt-2: Vision-language-action models transfer web knowledge to robotic control
We study how vision-language models trained on Internet-scale data can be incorporated
directly into end-to-end robotic control to boost generalization and enable emergent …
directly into end-to-end robotic control to boost generalization and enable emergent …
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration0
Large, high-capacity models trained on diverse datasets have shown remarkable successes
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Navigating to objects in the real world
Semantic navigation is necessary to deploy mobile robots in uncontrolled environments
such as homes or hospitals. Many learning-based approaches have been proposed in …
such as homes or hospitals. Many learning-based approaches have been proposed in …
Bc-z: Zero-shot task generalization with robotic imitation learning
In this paper, we study the problem of enabling a vision-based robotic manipulation system
to generalize to novel tasks, a long-standing challenge in robot learning. We approach the …
to generalize to novel tasks, a long-standing challenge in robot learning. We approach the …
Moka: Open-vocabulary robotic manipulation through mark-based visual prompting
Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …
and diverse environments and task goals. While the recent advances in vision language …