Single image depth estimation: An overview

A Mertan, DJ Duff, G Unal - Digital Signal Processing, 2022 - Elsevier
We review solutions to the problem of depth estimation, arguably the most important subtask
in scene understanding. We focus on the single image depth estimation problem. Due to its …

Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

Zoedepth: Zero-shot transfer by combining relative and metric depth

SF Bhat, R Birkl, D Wofk, P Wonka, M Müller - arxiv preprint arxiv …, 2023 - arxiv.org
This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …

Metric3d: Towards zero-shot metric 3d prediction from a single image

W Yin, C Zhang, H Chen, Z Cai, G Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Reconstructing accurate 3D scenes from images is a long-standing vision task. Due to the ill-
posedness of the single-image reconstruction problem, most well-established methods are …

Vision transformers for dense prediction

R Ranftl, A Bochkovskiy… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We introduce dense prediction transformers, an architecture that leverages vision
transformers in place of convolutional networks as a backbone for dense prediction tasks …

Battle of the backbones: A large-scale comparison of pretrained models across computer vision tasks

M Goldblum, H Souri, R Ni, M Shu… - Advances in …, 2024 - proceedings.neurips.cc
Neural network based computer vision systems are typically built on a backbone, a
pretrained or randomly initialized feature extractor. Several years ago, the default option was …

P3depth: Monocular depth estimation with a piecewise planarity prior

V Patil, C Sakaridis, A Liniger… - Proceedings of the …, 2022 - openaccess.thecvf.com
Monocular depth estimation is vital for scene understanding and downstream tasks. We
focus on the supervised setup, in which ground-truth depth is available only at training time …

Boosting monocular depth estimation models to high-resolution via content-adaptive multi-resolution merging

SMH Miangoleh, S Dille, L Mai… - Proceedings of the …, 2021 - openaccess.thecvf.com
Neural networks have shown great abilities in estimating depth from a single image.
However, the inferred depth maps are well below one-megapixel resolution and often lack …

Learning to recover 3d scene shape from a single image

W Yin, J Zhang, O Wang, S Niklaus… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite significant progress in monocular depth estimation in the wild, recent state-of-the-art
methods cannot be used to recover accurate 3D scene shape due to an unknown depth shift …

Depth pro: Sharp monocular metric depth in less than a second

A Bochkovskii, A Delaunoy, H Germain… - arxiv preprint arxiv …, 2024 - arxiv.org
We present a foundation model for zero-shot metric monocular depth estimation. Our model,
Depth Pro, synthesizes high-resolution depth maps with unparalleled sharpness and high …