Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Bevfusion: Multi-task multi-sensor fusion with unified bird's-eye view representation

Z Liu, H Tang, A Amini, X Yang, H Mao… - … on robotics and …, 2023 - ieeexplore.ieee.org
Multi-sensor fusion is essential for an accurate and reliable autonomous driving system.
Recent approaches are based on point-level fusion: augmenting the LiDAR point cloud with …

Theseus: A library for differentiable nonlinear optimization

L Pineda, T Fan, M Monge… - Advances in …, 2022 - proceedings.neurips.cc
We present Theseus, an efficient application-agnostic open source library for differentiable
nonlinear least squares (DNLS) optimization built on PyTorch, providing a common …

Challenges for monocular 6d object pose estimation in robotics

D Bauer, P Hönig, JB Weibel… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Object pose estimation is a core perception task that enables, for example, object
manipulation and scene understanding. The widely available, inexpensive, and high …

Pf-lrm: Pose-free large reconstruction model for joint pose and shape prediction

P Wang, H Tan, S Bi, Y Xu, F Luan, K Sunkavalli… - arxiv preprint arxiv …, 2023 - arxiv.org
We propose a Pose-Free Large Reconstruction Model (PF-LRM) for reconstructing a 3D
object from a few unposed images even with little visual overlap, while simultaneously …

Cape: Camera view position embedding for multi-view 3d object detection

K **ong, S Gong, X Ye, X Tan, J Wan… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we address the problem of detecting 3D objects from multi-view images.
Current query-based methods rely on global 3D position embeddings (PE) to learn the …

Object pose estimation with statistical guarantees: Conformal keypoint detection and geometric uncertainty propagation

H Yang, M Pavone - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
The two-stage object pose estimation paradigm first detects semantic keypoints on the
image and then estimates the 6D pose by minimizing reprojection errors. Despite performing …

Learning analytical posterior probability for human mesh recovery

Q Fang, K Chen, Y Fan, Q Shuai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite various probabilistic methods for modeling the uncertainty and ambiguity in human
mesh recovery, their overall precision is limited because existing formulations for joint …

Differentiable registration of images and lidar point clouds with voxelpoint-to-pixel matching

J Zhou, B Ma, W Zhang, Y Fang… - Advances in Neural …, 2024 - proceedings.neurips.cc
Cross-modality registration between 2D images captured by cameras and 3D point clouds
from LiDARs is a crucial task in computer vision and robotic. Previous methods estimate 2D …

Shape-constraint recurrent flow for 6d object pose estimation

Y Hai, R Song, J Li, Y Hu - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Most recent 6D object pose estimation methods rely on 2D optical flow networks to refine
their results. However, these optical flow methods typically do not consider any 3D shape …