“Where am I?” Scene Retrieval with Language

J Chen, D Barath, I Armeni, M Pollefeys… - European Conference on …, 2024‏ - Springer
Natural language interfaces to embodied AI are becoming more ubiquitous in our daily lives.
This opens up further opportunities for language-based interaction with embodied agents …

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

Y Miao, F Engelmann, O Vysotska, F Tombari… - … on Computer Vision, 2024‏ - Springer
We introduce the task of localizing an input image within a multi-modal reference map
represented by a collection of 3D scene graphs. These scene graphs comprise multiple …