Objaverse-xl: A universe of 10m+ 3d objects

M Deitke, R Liu, M Wallingford, H Ngo… - Advances in …, 2023 - proceedings.neurips.cc
Natural language processing and 2D vision models have attained remarkable proficiency on
many tasks primarily by escalating the scale of training data. However, 3D vision tasks have …

Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation

T Wu, J Zhang, X Fu, Y Wang, J Ren… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of
large-scale real-scanned 3D databases. To facilitate the development of 3D perception …

Gapartnet: Cross-category domain-generalizable object perception and manipulation via generalizable and actionable parts

H Geng, H Xu, C Zhao, C Xu, L Yi… - Proceedings of the …, 2023 - openaccess.thecvf.com
For years, researchers have been devoted to generalizable object perception and
manipulation, where cross-category generalizability is highly desired yet underexplored. In …

Full-body articulated human-object interaction

N Jiang, T Liu, Z Cao, J Cui, Z Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Fine-grained capture of 3D Human-Object Interactions (HOIs) boosts human activity
understanding and facilitates various downstream visual tasks. Prior models mostly assume …

Paris: Part-level reconstruction and motion analysis for articulated objects

J Liu, A Mahdavi-Amiri, M Savva - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We address the task of simultaneous part-level reconstruction and motion parameter
estimation for articulated objects. Given two sets of multi-view images of an object in two …

Carto: Category and joint agnostic reconstruction of articulated objects

N Heppert, MZ Irshad, S Zakharov… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present CARTO, a novel approach for reconstructing multiple articulated objects from a
single stereo RGB observation. We use implicit object-centric representations and learn a …

Grounding 3d object affordance from 2d interactions in images

Y Yang, W Zhai, H Luo, Y Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Grounding 3D object affordance seeks to locate objects'" action possibilities" regions in the
3D space, which serves as a link between perception and operation for embodied agents …

Visual-tactile sensing for in-hand object reconstruction

W Xu, Z Yu, H Xue, R Ye, S Yao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Tactile sensing is one of the modalities human rely on heavily to perceive the world. Working
with vision, this modality refines local geometry structure, measures deformation at contact …

CAGE: controllable articulation generation

J Liu, HII Tam, A Mahdavi-Amiri… - Proceedings of the …, 2024 - openaccess.thecvf.com
We address the challenge of generating 3D articulated objects in a controllable fashion.
Currently modeling articulated 3D objects is either achieved through laborious manual …

Egochoir: Capturing 3d human-object interaction regions from egocentric views

Y Yang, W Zhai, C Wang, C Yu… - Advances in Neural …, 2025 - proceedings.neurips.cc
Understanding egocentric human-object interaction (HOI) is a fundamental aspect of human-
centric perception, facilitating applications like AR/VR and embodied AI. For the egocentric …