Μελετητής Google

J Duan, S Yu, HL Tan, H Zhu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

There has been an emerging paradigm shift from the era of “internet AI” to “embodied AI,”
where AI algorithms and agents no longer learn from datasets of images, videos or text …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 322 Σχετικά άρθρα Όλες οι 8 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rt-1: Robotics transformer for real-world control at scale

A Brohan, N Brown, J Carbajal, Y Chebotar… - arxiv preprint arxiv …, 2022 - arxiv.org

By transferring knowledge from large, diverse, task-agnostic datasets, modern machine
learning models can solve specific downstream tasks either zero-shot or with small task …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 923 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

Y Chebotar, Q Vuong, K Hausman… - … on Robot Learning, 2023 - proceedings.mlr.press

In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 85 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] science.org

Navigating to objects in the real world

T Gervet, S Chintala, D Batra, J Malik, DS Chaplot - Science Robotics, 2023 - science.org

Semantic navigation is necessary to deploy mobile robots in uncontrolled environments
such as homes or hospitals. Many learning-based approaches have been proposed in …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 114 Σχετικά άρθρα Όλες οι 8 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Moka: Open-vocabulary robotic manipulation through mark-based visual prompting

F Liu, K Fang, P Abbeel, S Levine - First Workshop on Vision …, 2024 - openreview.net

Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 56 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

History aware multimodal transformer for vision-and-language navigation

S Chen, PL Guhur, C Schmid… - Advances in neural …, 2021 - proceedings.neurips.cc

Vision-and-language navigation (VLN) aims to build autonomous visual agents that follow
instructions and navigate in real scenes. To remember previously visited locations and …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 239 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open-vocabulary queryable scene representations for real world planning

B Chen, F **a, B Ichter, K Rao… - … on Robotics and …, 2023 - ieeexplore.ieee.org

Large language models (LLMs) have unlocked new capabilities of task planning from
human instructions. However, prior attempts to apply LLMs to real-world robotic tasks are …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 187 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Think global, act local: Dual-scale graph transformer for vision-and-language navigation

S Chen, PL Guhur, M Tapaswi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Following language instructions to navigate in unseen environments is a challenging
problem for autonomous embodied agents. The agent not only needs to ground languages …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 160 Σχετικά άρθρα Όλες οι 9 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Spatio-temporal graph transformer networks for pedestrian trajectory prediction

C Yu, X Ma, J Ren, H Zhao, S Yi - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer

Understanding crowd motion dynamics is critical to real-world applications, eg, surveillance
systems and autonomous driving. This is challenging because it requires effectively …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 578 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Poni: Potential functions for objectgoal navigation with interaction-free learning

SK Ramakrishnan, DS Chaplot… - Proceedings of the …, 2022 - openaccess.thecvf.com

State-of-the-art approaches to ObjectGoal navigation (ObjectNav) rely on reinforcement
learning and typically require significant computational resources and time for learning. We …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 165 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

A survey of embodied ai: From simulators to research tasks

Rt-1: Robotics transformer for real-world control at scale

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

Navigating to objects in the real world

Moka: Open-vocabulary robotic manipulation through mark-based visual prompting

History aware multimodal transformer for vision-and-language navigation

Open-vocabulary queryable scene representations for real world planning

Think global, act local: Dual-scale graph transformer for vision-and-language navigation

Spatio-temporal graph transformer networks for pedestrian trajectory prediction

Poni: Potential functions for objectgoal navigation with interaction-free learning