Google 학술 검색

Y Wu, X Cheng, R Zhang, Z Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract 3D visual grounding aims to find the object within point clouds mentioned by free-
form natural language descriptions with rich semantic cues. However, existing methods …

저장 인용 85회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Film: Following instructions in language with modular methods

SY Min, DS Chaplot, P Ravikumar, Y Bisk… - ar** through instruction following

M Ding, Y Xu, Z Chen, DD Cox, P Luo… - … on robot learning, 2023 - proceedings.mlr.press

Humans, even at a very early age, can learn visual concepts and understand geometry and
layout through active interaction with the environment, and generalize their compositions to …

저장 인용 19회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

Episodic memory question answering

S Datta, S Dharur, V Cartillier, R Desai… - Proceedings of the …, 2022 - openaccess.thecvf.com

Egocentric augmented reality devices such as wearable glasses passively capture visual
data as a human wearer tours a home environment. We envision a scenario wherein the …

저장 인용 38회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Learning 3d dynamic scene representations for robot manipulation

Z Xu, Z He, J Wu, S Song - arxiv preprint arxiv:2011.01968, 2020 - arxiv.org

3D scene representation for robot manipulation should capture three key object properties:
permanency--objects that become occluded over time continue to exist; amodal …

저장 인용 58회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

Four ways to improve verbo-visual fusion for dense 3d visual grounding

O Unal, C Sakaridis, S Saha, L Van Gool - European Conference on …, 2024 - Springer

Abstract 3D visual grounding is the task of localizing the object in a 3D scene which is
referred by a description in natural language. With a wide range of applications ranging from …

저장 인용 3회 인용 관련 학술자료 전체 3개의 버전

Visual language navigation: A survey and open challenges

SM Park, YG Kim - Artificial Intelligence Review, 2023 - Springer

With the recent development of deep learning, AI models are widely used in various
domains. AI models show good performance for definite tasks such as image classification …

저장 인용 32회 인용 관련 학술자료 전체 5개의 버전

[Free GPT-4]

[PDF] thecvf.com

Fast and explicit neural view synthesis

P Guo, MA Bautista, A Colburn… - Proceedings of the …, 2022 - openaccess.thecvf.com

We study the problem of novel view synthesis from sparse source observations of a scene
comprised of 3D objects. We propose a simple yet effective approach that is neither …

저장 인용 34회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Voxel-informed language grounding

R Corona, S Zhu, D Klein, T Darrell - arxiv preprint arxiv:2205.09710, 2022 - arxiv.org

Natural language applied to natural 2D images describes a fundamentally 3D world. We
present the Voxel-informed Language Grounder (VLG), a language grounding model that …

저장 인용 13회 인용 관련 학술자료 전체 6개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

Multi-Attribute Interactions Matter for 3D Visual Grounding

C Xu, Y Han, R Xu, L Hui, J **e… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract 3D visual grounding aims to localize 3D objects described by free-form language
sentences. Following the detection-then-matching paradigm existing methods mainly focus …

저장 인용 관련 학술자료 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Embodied language grounding with 3d visual feature representations

Eda: Explicit text-decoupling and dense alignment for 3d visual grounding

Film: Following instructions in language with modular methods

Episodic memory question answering

Learning 3d dynamic scene representations for robot manipulation

Four ways to improve verbo-visual fusion for dense 3d visual grounding

Visual language navigation: A survey and open challenges

Fast and explicit neural view synthesis

Voxel-informed language grounding

Multi-Attribute Interactions Matter for 3D Visual Grounding