ViNT: A foundation model for visual navigation

D Shah, A Sridhar, N Dashora, K Stachowicz… - ar** Socially Compliant Automated Vehicles: State of the Art, Experts Expectations, and A Conceptual Framework
Y Dong, B van Arem, H Farah - arxiv preprint arxiv:2501.06089, 2025 - arxiv.org
Automated Vehicles (AVs) hold promise for revolutionizing transportation by improving road
safety, traffic efficiency, and overall mobility. Despite the steady advancement in high-level …

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

X Liu, J Li, Y Jiang, N Sujay, Z Yang, J Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Navigating dynamic urban environments presents significant challenges for embodied
agents, requiring advanced spatial reasoning and adherence to common-sense norms …

Socially aware robot navigation through scoring using vision-language models

D Song, J Liang, A Payandeh, X **ao… - arxiv preprint arxiv …, 2024 - arxiv.org
We propose VLM-Social-Nav, a novel Vision-Language Model (VLM) based navigation
approach to compute a robot's trajectory in human-centered environments. Our goal is to …

OLiVia-Nav: An online lifelong vision language approach for mobile robot social navigation

S Narasimhan, AH Tan, D Choi, G Nejat - arxiv preprint arxiv:2409.13675, 2024 - arxiv.org
Service robots in human-centered environments such as hospitals, office buildings, and long-
term care homes need to navigate while adhering to social norms to ensure the safety and …