الباحث العلمي من Google

Y Wu, P Zhang, M Gu, J Zheng, X Bai - Information Fusion, 2024‏ - Elsevier‏

Embodied AI aims to create agents that complete complex tasks by interacting with the
environment. A key problem in this field is embodied navigation which understands multi …‏

حفظ اقتباس تم اقتباسها في عدد: 5 مقالات ذات صلة الإصدارات الـ 3كلها

[Free GPT-4]

[PDF] neurips.cc

History aware multimodal transformer for vision-and-language navigation‏

S Chen, PL Guhur, C Schmid… - Advances in neural …, 2021‏ - proceedings.neurips.cc‏

Vision-and-language navigation (VLN) aims to build autonomous visual agents that follow
instructions and navigate in real scenes. To remember previously visited locations and …‏

حفظ اقتباس تم اقتباسها في عدد: 238 مقالات ذات صلة الإصدارات الـ 8كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Think global, act local: Dual-scale graph transformer for vision-and-language navigation‏

S Chen, PL Guhur, M Tapaswi… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Following language instructions to navigate in unseen environments is a challenging
problem for autonomous embodied agents. The agent not only needs to ground languages …‏

حفظ اقتباس تم اقتباسها في عدد: 158 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Vln bert: A recurrent vision-and-language bert for navigation‏

Y Hong, Q Wu, Y Qi… - Proceedings of the …, 2021‏ - openaccess.thecvf.com‏

Accuracy of many visiolinguistic tasks has benefited significantly from the application of
vision-and-language (V&L) BERT. However, its application for the task of vision-and …‏

حفظ اقتباس تم اقتباسها في عدد: 289 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Room-across-room: Multilingual vision-and-language navigation with dense spatiotemporal grounding‏

A Ku, P Anderson, R Patel, E Ie, J Baldridge - arxiv preprint arxiv …, 2020‏ - arxiv.org‏

We introduce Room-Across-Room (RxR), a new Vision-and-Language Navigation (VLN)
dataset. RxR is multilingual (English, Hindi, and Telugu) and larger (more paths and …‏

حفظ اقتباس تم اقتباسها في عدد: 304 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Scaling data generation in vision-and-language navigation‏

Z Wang, J Li, Y Hong, Y Wang, Q Wu… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Recent research in language-guided visual navigation has demonstrated a significant
demand for the diversity of traversable environments and the quantity of supervision for …‏

حفظ اقتباس تم اقتباسها في عدد: 60 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Vision-and-language navigation: A survey of tasks, methods, and future directions‏

J Gu, E Stefani, Q Wu, J Thomason… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

A long-term goal of AI research is to build intelligent agents that can communicate with
humans in natural language, perceive the environment, and perform real-world tasks. Vision …‏

حفظ اقتباس تم اقتباسها في عدد: 130 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] jair.org Full View‏

Core challenges in embodied vision-language planning‏

J Francis, N Kitamura, F Labelle, X Lu, I Navarro… - Journal of Artificial …, 2022‏ - jair.org‏

Recent advances in the areas of multimodal machine learning and artificial intelligence (AI)
have led to the development of challenging tasks at the intersection of Computer Vision …‏

حفظ اقتباس تم اقتباسها في عدد: 51 مقالات ذات صلة الإصدارات الـ 14كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Envedit: Environment editing for vision-and-language navigation‏

J Li, H Tan, M Bansal - … of the IEEE/CVF Conference on …, 2022‏ - openaccess.thecvf.com‏

Abstract In Vision-and-Language Navigation (VLN), an agent needs to navigate through the
environment based on natural language instructions. Due to limited available data for agent …‏

حفظ اقتباس تم اقتباسها في عدد: 88 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Hop: History-and-order aware pre-training for vision-and-language navigation‏

Y Qiao, Y Qi, Y Hong, Z Yu… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Pre-training has been adopted in a few of recent works for Vision-and-Language Navigation
(VLN). However, previous pre-training methods for VLN either lack the ability to predict …‏

حفظ اقتباس تم اقتباسها في عدد: 88 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

General evaluation for instruction conditioned navigation using dynamic time war**

Embodied navigation with multi-modal information: A survey from tasks to methodology‏

History aware multimodal transformer for vision-and-language navigation‏

Think global, act local: Dual-scale graph transformer for vision-and-language navigation‏

Vln bert: A recurrent vision-and-language bert for navigation‏

Room-across-room: Multilingual vision-and-language navigation with dense spatiotemporal grounding‏

Scaling data generation in vision-and-language navigation‏

Vision-and-language navigation: A survey of tasks, methods, and future directions‏

Core challenges in embodied vision-language planning‏

Envedit: Environment editing for vision-and-language navigation‏

Hop: History-and-order aware pre-training for vision-and-language navigation‏