- Academic Search

W Ju, Z Fang, Y Gu, Z Liu, Q Long, Z Qiao, Y Qin… - Neural Networks, 2024 - Elsevier

Graph representation learning aims to effectively encode high-dimensional sparse graph-
structured data into low-dimensional dense vectors, which is a fundamental task that has …

Salva Cita Citato da 168 Articoli correlati Tutte e 6 le versioni

[Free GPT-4]

[PDF] arxiv.org

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Y Liu, W Chen, Y Bai, X Liang, G Li, W Gao… - ar** embodied agents. In …

Salva Cita Citato da 42 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] thecvf.com

Multi3drefer: Grounding text description to multiple 3d objects

Y Zhang, ZM Gong, AX Chang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We introduce the task of localizing a flexible number of objects in real-world 3D scenes
using natural language descriptions. Existing 3D visual grounding tasks focus on localizing …

Salva Cita Citato da 60 Articoli correlati Tutte e 7 le versioni Ricerca biblioteche Versione HTML

[Free GPT-4]

[PDF] thecvf.com

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

S Chen, X Chen, C Zhang, M Li, G Yu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Recent progress in Large Multimodal Models (LMM) has opened up great
possibilities for various applications in the field of human-machine interactions. However …

Salva Cita Citato da 61 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

3djcg: A unified framework for joint dense captioning and visual grounding on 3d point clouds

D Cai, L Zhao, J Zhang, L Sheng… - Proceedings of the …, 2022 - openaccess.thecvf.com

Observing that the 3D captioning task and the 3D grounding task contain both shared and
complementary information in nature, in this work, we propose a unified framework to jointly …

Salva Cita Citato da 109 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Eda: Explicit text-decoupling and dense alignment for 3d visual grounding

Y Wu, X Cheng, R Zhang, Z Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract 3D visual grounding aims to find the object within point clouds mentioned by free-
form natural language descriptions with rich semantic cues. However, existing methods …

Salva Cita Citato da 85 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Multi-view transformer for 3d visual grounding

S Huang, Y Chen, J Jia, L Wang - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

The 3D visual grounding task aims to ground a natural language description to the targeted
object in a 3D scene, which is usually represented in 3D point clouds. Previous works …

Salva Cita Citato da 115 Articoli correlati Tutte e 5 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Transrefer3d: Entity-and-relation aware transformer for fine-grained 3d visual grounding

A comprehensive survey on deep graph representation learning

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Multi3drefer: Grounding text description to multiple 3d objects

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

3djcg: A unified framework for joint dense captioning and visual grounding on 3d point clouds

Eda: Explicit text-decoupling and dense alignment for 3d visual grounding

Multi-view transformer for 3d visual grounding