Μελετητής Google

K Kawaharazuka, T Matsushima… - Advanced …, 2024 - Taylor & Francis

Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 40 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Y Liu, W Chen, Y Bai, X Liang, G Li, W Gao… - arxiv preprint arxiv …, 2024 - arxiv.org

Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 35 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Foundation models in robotics: Applications, challenges, and the future

R Firoozi, J Tucker, S Tian… - … Journal of Robotics …, 2023 - journals.sagepub.com

We survey applications of pretrained foundation models in robotics. Traditional deep
learning models in robotics are trained on small datasets tailored for specific tasks, which …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 126 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Openeqa: Embodied question answering in the era of foundation models

A Majumdar, A Ajay, X Zhang, P Putta… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present a modern formulation of Embodied Question Answering (EQA) as the task of
understanding an environment well enough to answer questions about it in natural …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 92 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Ok-robot: What really matters in integrating open-knowledge models for robotics

P Liu, Y Orru, J Vakil, C Paxton, NMM Shafiullah… - arxiv preprint arxiv …, 2024 - arxiv.org

Remarkable progress has been made in recent years in the fields of vision, language, and
robotics. We now have vision models capable of recognizing objects based on language …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 63 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Affordancellm: Grounding affordance from vision language models

S Qian, W Chen, M Bai, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

Affordance grounding refers to the task of finding the area of an object with which one can
interact. It is a fundamental but challenging task as a successful solution requires the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 29 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] openreview.net

Large language models as generalizable policies for embodied tasks

A Szot, M Schwarzer, H Agrawal… - The Twelfth …, 2023 - openreview.net

We show that large language models (LLMs) can be adapted to be generalizable policies
for embodied visual tasks. Our approach, called Large LAnguage model Reinforcement …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 44 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Prompt a robot to walk with large language models

YJ Wang, B Zhang, J Chen, K Sreenath - arxiv preprint arxiv:2309.09969, 2023 - arxiv.org

Large language models (LLMs) pre-trained on vast internet-scale data have showcased
remarkable capabilities across diverse domains. Recently, there has been escalating …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 49 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

V-IRL: Grounding Virtual Intelligence in Real Life

J Yang, R Ding, E Brown, X Qi, S **e - European Conference on Computer …, 2024 - Springer

There is a sensory gulf between the Earth that humans inhabit and the digital realms in
which modern AI agents are created. To develop AI agents that can sense, think, and act as …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 13 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Habitat synthetic scenes dataset (hssd-200): An analysis of 3d scene scale and realism tradeoffs for objectgoal navigation

M Khanna, Y Mao, H Jiang, S Haresh… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract We contribute the Habitat Synthetic Scene Dataset a dataset of 211 high-quality 3D
scenes and use it to test navigation agent generalization to realistic 3D environments. Our …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 39 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Homerobot: Open-vocabulary mobile manipulation

Real-world robot applications of foundation models: A review

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Foundation models in robotics: Applications, challenges, and the future

Openeqa: Embodied question answering in the era of foundation models

Ok-robot: What really matters in integrating open-knowledge models for robotics

Affordancellm: Grounding affordance from vision language models

Large language models as generalizable policies for embodied tasks

Prompt a robot to walk with large language models

V-IRL: Grounding Virtual Intelligence in Real Life

Habitat synthetic scenes dataset (hssd-200): An analysis of 3d scene scale and realism tradeoffs for objectgoal navigation