- Academic Search

K Kawaharazuka, T Matsushima… - Advanced …, 2024 - Taylor & Francis

Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …

Enregistrer Citer Cité 39 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Rt-2: Vision-language-action models transfer web knowledge to robotic control

A Brohan, N Brown, J Carbajal, Y Chebotar… - arxiv preprint arxiv …, 2023 - arxiv.org

We study how vision-language models trained on Internet-scale data can be incorporated
directly into end-to-end robotic control to boost generalization and enable emergent …

Enregistrer Citer Cité 777 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

3d-llm: Injecting the 3d world into large language models

Y Hong, H Zhen, P Chen, S Zheng… - Advances in …, 2023 - proceedings.neurips.cc

Large language models (LLMs) and Vision-Language Models (VLMs) have been proved to
excel at multiple tasks, such as commonsense reasoning. Powerful as these models can be …

Enregistrer Citer Cité 251 fois Autres articles Les 7 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] ieee.org

Chatgpt for robotics: Design principles and model abilities

SH Vemprala, R Bonatti, A Bucker, A Kapoor - IEEE Access, 2024 - ieeexplore.ieee.org

This paper presents an experimental study regarding the use of OpenAI's ChatGPT for
robotics applications. We outline a strategy that combines design principles for prompt …

Enregistrer Citer Cité 495 fois Autres articles Les 7 versions Free GPT-4

[Free GPT-4]

[HTML] mlr.press

[HTML][HTML] Rt-2: Vision-language-action models transfer web knowledge to robotic control

B Zitkovich, T Yu, S Xu, P Xu, T **ao… - … on Robot Learning, 2023 - proceedings.mlr.press

We study how vision-language models trained on Internet-scale data can be incorporated
directly into end-to-end robotic control to boost generalization and enable emergent …

Enregistrer Citer Cité 195 fois Autres articles Les 2 versions Free GPT-4 En cache

[Free GPT-4]

[PDF] thecvf.com

Your diffusion model is secretly a zero-shot classifier

AC Li, M Prabhudesai, S Duggal… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent wave of large-scale text-to-image diffusion models has dramatically increased
our text-based image generation abilities. These models can generate realistic images for a …

Enregistrer Citer Cité 221 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Conceptgraphs: Open-vocabulary 3d scene graphs for perception and planning

Q Gu, A Kuwajerwala, S Morin… - … on Robotics and …, 2024 - ieeexplore.ieee.org

For robots to perform a wide variety of tasks, they require a 3D representation of the world
that is semantically rich, yet compact and efficient for task-driven perception and planning …

Enregistrer Citer Cité 135 fois Autres articles Les 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Visual language maps for robot navigation

C Huang, O Mees, A Zeng… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Grounding language to the visual observations of a navigating agent can be performed
using off-the-shelf visual-language models pretrained on Internet-scale data (eg, image …

Enregistrer Citer Cité 366 fois Autres articles Les 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

H Bharadhwaj, J Vakil, M Sharma… - … on Robotics and …, 2024 - ieeexplore.ieee.org

The grand aim of having a single robot that can manipulate arbitrary objects in diverse
settings is at odds with the paucity of robotics datasets. Acquiring and growing such datasets …

Enregistrer Citer Cité 88 fois Autres articles Les 3 versions Free GPT-4

[Free GPT-4]

[PDF] sagepub.com

Transfer learning in robotics: An upcoming breakthrough? A review of promises and challenges

N Jaquier, MC Welle, A Gams, K Yao… - … Journal of Robotics …, 2023 - journals.sagepub.com

Transfer learning is a conceptually-enticing paradigm in pursuit of truly intelligent embodied
agents. The core concept—reusing prior knowledge to learn in and from novel situations—is …

Enregistrer Citer Cité 13 fois Autres articles Les 2 versions Free GPT-4

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Clip on wheels: Zero-shot object navigation as object localization and exploration

Real-world robot applications of foundation models: A review

Rt-2: Vision-language-action models transfer web knowledge to robotic control

3d-llm: Injecting the 3d world into large language models

Chatgpt for robotics: Design principles and model abilities

[HTML][HTML] Rt-2: Vision-language-action models transfer web knowledge to robotic control

Your diffusion model is secretly a zero-shot classifier

Conceptgraphs: Open-vocabulary 3d scene graphs for perception and planning

Visual language maps for robot navigation

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

Transfer learning in robotics: An upcoming breakthrough? A review of promises and challenges