Deep learning for monocular depth estimation: A review

Y Ming, X Meng, C Fan, H Yu - Neurocomputing, 2021 - Elsevier
Depth estimation is a classic task in computer vision, which is of great significance for many
applications such as augmented reality, target tracking and autonomous driving. Traditional …

[HTML][HTML] Monocular depth estimation using deep learning: A review

A Masoumian, HA Rashwan, J Cristiano, MS Asif… - Sensors, 2022 - mdpi.com
In current decades, significant advancements in robotics engineering and autonomous
vehicles have improved the requirement for precise depth measurements. Depth estimation …

Repurposing diffusion-based image generators for monocular depth estimation

B Ke, A Obukhov, S Huang, N Metzger… - Proceedings of the …, 2024 - openaccess.thecvf.com
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …

Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving

Y Wei, L Zhao, W Zheng, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …

Learning to upsample by learning to sample

W Liu, H Lu, H Fu, Z Cao - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
We present DySample, an ultra-lightweight and effective dynamic upsampler. While
impressive performance gains have been witnessed from recent kernel-based dynamic …

Zoedepth: Zero-shot transfer by combining relative and metric depth

SF Bhat, R Birkl, D Wofk, P Wonka, M Müller - arxiv preprint arxiv …, 2023 - arxiv.org
This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …

UniDepth: Universal monocular metric depth estimation

L Piccinelli, YH Yang, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com
Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks
in 3D perception and modeling. However the remarkable accuracy of recent MMDE methods …

Unleashing text-to-image diffusion models for visual perception

W Zhao, Y Rao, Z Liu, B Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Diffusion models (DMs) have become the new trend of generative models and have
demonstrated a powerful ability of conditional synthesis. Among those, text-to-image …

Ddp: Diffusion model for dense visual prediction

Y Ji, Z Chen, E **e, L Hong, X Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a simple, efficient, yet powerful framework for dense visual predictions based
on the conditional diffusion pipeline. Our approach follows a" noise-to-map" generative …

Neural window fully-connected crfs for monocular depth estimation

W Yuan, X Gu, Z Dai, S Zhu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Estimating the accurate depth from a single image is challenging since it is inherently
ambiguous and ill-posed. While recent works design increasingly complicated and powerful …