Chatgpt for robotics: Design principles and model abilities
This paper presents an experimental study regarding the use of OpenAI's ChatGPT for
robotics applications. We outline a strategy that combines design principles for prompt …
robotics applications. We outline a strategy that combines design principles for prompt …
Foundation models in robotics: Applications, challenges, and the future
We survey applications of pretrained foundation models in robotics. Traditional deep
learning models in robotics are trained on small datasets tailored for specific tasks, which …
learning models in robotics are trained on small datasets tailored for specific tasks, which …
Moka: Open-vocabulary robotic manipulation through mark-based visual prompting
Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …
and diverse environments and task goals. While the recent advances in vision language …
Esc: Exploration with soft commonsense constraints for zero-shot object navigation
The ability to accurately locate and navigate to a specific object is a crucial capability for
embodied agents that operate in the real world and interact with objects to complete tasks …
embodied agents that operate in the real world and interact with objects to complete tasks …
Can an Embodied Agent Find Your “Cat-shaped Mug”? LLM-Based Zero-Shot Object Navigation
We present language-guided exploration (LGX), a novel algorithm for Language-Driven
Zero-Shot Object Goal Navigation (L-ZSON), where an embodied agent navigates to an …
Zero-Shot Object Goal Navigation (L-ZSON), where an embodied agent navigates to an …
Robot learning in the era of foundation models: A survey
The proliferation of Large Language Models (LLMs) has s fueled a shift in robot learning
from automation towards general embodied Artificial Intelligence (AI). Adopting foundation …
from automation towards general embodied Artificial Intelligence (AI). Adopting foundation …
Large language models as generalizable policies for embodied tasks
We show that large language models (LLMs) can be adapted to be generalizable policies
for embodied visual tasks. Our approach, called Large LAnguage model Reinforcement …
for embodied visual tasks. Our approach, called Large LAnguage model Reinforcement …