- Academic Search

Artikel

Scholar

2 Ergebnisse (0,02 Sek.)

Mein Profil Meine Bibliothek

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

In Artikeln mit Zitaten suchen

[Free GPT-4]

[PDF] arxiv.org

Embodiedocc: Embodied 3d occupancy prediction for vision-based online scene understanding

Y Wu, W Zheng, S Zuo, Y Huang, J Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org

3D occupancy prediction provides a comprehensive description of the surrounding scenes
and has become an essential task for 3D perception. Most existing methods focus on offline …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel HTML-Version

[Free GPT-4]

[PDF] arxiv.org

ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding

AT Wang, ZM Gong, AX Chang - arxiv preprint arxiv:2501.01366, 2025 - arxiv.org

3D visual grounding (3DVG) involves localizing entities in a 3D scene referred to by natural
language text. Such models are useful for embodied AI and scene retrieval applications …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Embodiedocc: Embodied 3d occupancy prediction for vision-based online scene understanding

ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding