Deep learning-based object pose estimation: A comprehensive survey

J Liu, W Sun, H Yang, Z Zeng, C Liu, J Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org
Object pose estimation is a fundamental computer vision problem with broad applications in
augmented reality and robotics. Over the past decade, deep learning models, due to their …

Challenges for monocular 6d object pose estimation in robotics

D Bauer, P Hönig, JB Weibel… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Object pose estimation is a core perception task that enables, for example, object
manipulation and scene understanding. The widely available, inexpensive, and high …

Sg-bot: Object rearrangement via coarse-to-fine robotic imagination on scene graphs

G Zhai, X Cai, D Huang, Y Di… - … on Robotics and …, 2024 - ieeexplore.ieee.org
Object rearrangement is pivotal in robotic-environment interactions, representing a
significant capability in embodied AI. In this paper, we present SG-Bot, a novel …

Handal: A dataset of real-world manipulable object categories with pose annotations, affordances, and reconstructions

A Guo, B Wen, J Yuan, J Tremblay… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org
We present the HANDAL dataset for category-level object pose estimation and affordance
prediction. Unlike previous datasets, ours is focused on robotics-ready manipulable objects …

Deformable 3d gaussian splatting for animatable human avatars

HJ Jung, N Brasch, J Song, E Perez-Pellitero… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent advances in neural radiance fields enable novel view synthesis of photo-realistic
images in dynamic settings, which can be applied to scenarios with human animation …

Secondpose: Se (3)-consistent dual-stream feature fusion for category-level pose estimation

Y Chen, Y Di, G Zhai, F Manhardt… - Proceedings of the …, 2024 - openaccess.thecvf.com
Category-level object pose estimation aiming to predict the 6D pose and 3D size of objects
from known categories typically struggles with large intra-class shape variation. Existing …

[HTML][HTML] Indoor synthetic data generation: A systematic review

H Schieber, KC Demir, C Kleinbeck, SH Yang… - Computer Vision and …, 2024 - Elsevier
Objective: Deep learning-based object recognition, 6D pose estimation, and semantic scene
understanding require a large amount of training data to achieve generalization. Time …

Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking

P Banerjee, S Shkodrani, P Moulon, S Hampali… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce HOT3D, a publicly available dataset for egocentric hand and object tracking in
3D. The dataset offers over 833 minutes (more than 3.7 M images) of multi-view …

Mh6d: multi-hypothesis consistency learning for category-level 6-d object pose estimation

J Liu, W Sun, C Liu, H Yang, X Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Six-degree-of-freedom (6DoF) object pose estimation is a crucial task for virtual reality and
accurate robotic manipulation. Category-level 6DoF pose estimation has recently become …

Zero123-6d: Zero-shot novel view synthesis for rgb category-level 6d pose estimation

F Di Felice, A Remus, S Gasperini… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
Estimating the pose of objects through vision is essential to make robotic platforms interact
with the environment. Yet, it presents many challenges, often related to the lack of flexibility …