DDF-HO: hand-held object reconstruction via conditional directed distance field

C Zhang, Y Di, R Zhang, G Zhai… - Advances in …, 2024 - proceedings.neurips.cc
Reconstructing hand-held objects from a single RGB image is an important and challenging
problem. Existing works utilizing Signed Distance Fields (SDF) reveal limitations in …

Rethinking inductive biases for surface normal estimation

G Bae, AJ Davison - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Despite the growing demand for accurate surface normal estimation models existing
methods use general-purpose dense prediction models adopting the same inductive biases …

Sg-bot: Object rearrangement via coarse-to-fine robotic imagination on scene graphs

G Zhai, X Cai, D Huang, Y Di… - … on Robotics and …, 2024 - ieeexplore.ieee.org
Object rearrangement is pivotal in robotic-environment interactions, representing a
significant capability in embodied AI. In this paper, we present SG-Bot, a novel …

HouseCat6D-A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios

HJ Jung, SC Wu, P Ruhkamp, G Zhai… - Proceedings of the …, 2024 - openaccess.thecvf.com
Estimating 6D object poses is a major challenge in 3D computer vision. Building on
successful instance-level approaches research is shifting towards category-level pose …

Ipcc-tp: Utilizing incremental pearson correlation coefficient for joint multi-agent trajectory prediction

D Zhu, G Zhai, Y Di, F Manhardt… - Proceedings of the …, 2023 - openaccess.thecvf.com
Reliable multi-agent trajectory prediction is crucial for the safe planning and control of
autonomous systems. Compared with single-agent cases, the major challenge in …

Secondpose: Se (3)-consistent dual-stream feature fusion for category-level pose estimation

Y Chen, Y Di, G Zhai, F Manhardt… - Proceedings of the …, 2024 - openaccess.thecvf.com
Category-level object pose estimation aiming to predict the 6D pose and 3D size of objects
from known categories typically struggles with large intra-class shape variation. Existing …

Opa-3d: Occlusion-aware pixel-wise aggregation for monocular 3d object detection

Y Su, Y Di, G Zhai, F Manhardt… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Monocular 3D object detection has recently made a significant leap forward thanks to the
use of pre-trained depth estimators for pseudo-LiDAR recovery. Yet, such two-stage …

An economic framework for 6-dof grasp detection

XM Wu, JF Cai, JJ Jiang, D Zheng, YL Wei… - … on Computer Vision, 2024 - Springer
Robotic gras** in clutters is a fundamental task in robotic manipulation. In this work, we
propose an economic framework for 6-DoF grasp detection, aiming to economize the …

Moho: Learning single-view hand-held object reconstruction with multi-view occlusion-aware supervision

C Zhang, G Jiao, Y Di, G Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Previous works concerning single-view hand-held object reconstruction typically rely on
supervision from 3D ground-truth models which are hard to collect in real world. In contrast …

Hipose: Hierarchical binary surface encoding and correspondence pruning for rgb-d 6dof object pose estimation

Y Lin, Y Su, P Nathan, S Inuganti, Y Di… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this work we present a novel dense-correspondence method for 6DoF object pose
estimation from a single RGB-D image. While many existing data-driven methods achieve …