Binsformer: Revisiting adaptive bins for monocular depth estimation

Z Li, X Wang, X Liu, J Jiang - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn
increasing attention. Recently, some methods reformulate it as a classification-regression …

Fully sparse fusion for 3d object detection

Y Li, L Fan, Y Liu, Z Huang, Y Chen… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Currently prevalent multi-modal 3D detection methods rely on dense detectors that usually
use dense Bird's-Eye-View (BEV) feature maps. However, the cost of such BEV feature …

Generalized predictive model for autonomous driving

J Yang, S Gao, Y Qiu, L Chen, T Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we introduce the first large-scale video prediction model in the autonomous
driving discipline. To eliminate the restriction of high-cost data collection and empower the …

Towards domain generalization for multi-view 3d object detection in bird-eye-view

S Wang, X Zhao, HM Xu, Z Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multi-view 3D object detection (MV3D-Det) in Bird-Eye-View (BEV) has drawn extensive
attention due to its low cost and high efficiency. Although new algorithms for camera-only 3D …

PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

Z Li, SF Bhat, P Wonka - European Conference on Computer Vision, 2024 - Springer
This paper introduces PatchRefiner, an advanced framework for metric single image depth
estimation aimed at high-resolution real-domain inputs. While depth estimation is crucial for …

Geometry-guided domain generalization for monocular 3d object detection

F Yang, H Chen, Y He, S Zhao, C Zhang, K Ni… - Proceedings of the …, 2024 - ojs.aaai.org
Monocular 3D object detection (M3OD) is important for autonomous driving. However,
existing deep learning-based methods easily suffer from performance degradation in real …

Learning from noisy data for semi-supervised 3d object detection

Z Chen, Z Li, S Wang, D Fu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Pseudo-Labeling (PL) is a critical approach in semi-supervised 3D object detection (SSOD).
In PL, delicately selected pseudo-labels, generated by the teacher model, are provided for …

Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

G Chang, J Lee, D Kim, J Kim, D Lee… - Advances in …, 2025 - proceedings.neurips.cc
Recent advances in 3D object detection leveraging multi-view cameras have demonstrated
their practical and economical value in various challenging vision tasks. However, typical …

Monocular 3D object detection for construction scene analysis

J Shen, L Jiao, C Zhang, K Peng - Computer‐Aided Civil and …, 2024 - Wiley Online Library
Abstract Three‐dimensional (3D) object detection, that is, localizing and classifying all
critical objects in a 3D space, is essential for downstream construction scene analysis tasks …

Ms3d: Leveraging multiple detectors for unsupervised domain adaptation in 3d object detection

D Tsai, JS Berrio, M Shan, E Nebot… - 2023 IEEE 26th …, 2023 - ieeexplore.ieee.org
We introduce Multi-Source 3D (MS3D), a new self-training pipeline for unsupervised domain
adaptation in 3D object detection. Despite the remarkable accuracy of 3D detectors, they …