Attention attention everywhere: Monocular depth prediction with skip attention

A Agarwal, C Arora - Proceedings of the IEEE/CVF Winter …, 2023 - openaccess.thecvf.com
Abstract Monocular Depth Estimation (MDE) aims to predict pixel-wise depth given a single
RGB image. For both, the convolutional as well as the recent attention-based models …

Adabins: Depth estimation using adaptive bins

SF Bhat, I Alhashim, P Wonka - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We address the problem of estimating a high quality dense depth map from a single RGB
input image. We start out with a baseline encoder-decoder convolutional neural network …

Binsformer: Revisiting adaptive bins for monocular depth estimation

Z Li, X Wang, X Liu, J Jiang - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn
increasing attention. Recently, some methods reformulate it as a classification-regression …

Depthformer: Exploiting long-range correlation and local information for accurate monocular depth estimation

Z Li, Z Chen, X Liu, J Jiang - Machine Intelligence Research, 2023 - Springer
This paper aims to address the problem of supervised monocular depth estimation. We start
with a meticulous pilot study to demonstrate that the long-range correlation is essential for …

P3depth: Monocular depth estimation with a piecewise planarity prior

V Patil, C Sakaridis, A Liniger… - Proceedings of the …, 2022 - openaccess.thecvf.com
Monocular depth estimation is vital for scene understanding and downstream tasks. We
focus on the supervised setup, in which ground-truth depth is available only at training time …

From big to small: Multi-scale local planar guidance for monocular depth estimation

JH Lee, MK Han, DW Ko, IH Suh - arxiv preprint arxiv:1907.10326, 2019 - arxiv.org
Estimating accurate depth from a single image is challenging because it is an ill-posed
problem as infinitely many 3D scenes can be projected to the same 2D scene. However …

Monocular depth estimation using laplacian pyramid-based depth residuals

M Song, S Lim, W Kim - … transactions on circuits and systems for …, 2021 - ieeexplore.ieee.org
With a great success of the generative model via deep neural networks, monocular depth
estimation has been actively studied by exploiting various encoder-decoder architectures …

Vip-deeplab: Learning visual perception with depth-aware video panoptic segmentation

S Qiao, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we present ViP-DeepLab, a unified model attempting to tackle the long-
standing and challenging inverse projection problem in vision, which we model as restoring …

Multimodal end-to-end autonomous driving

Y **ao, F Codevilla, A Gurram… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
A crucial component of an autonomous vehicle (AV) is the artificial intelligence (AI) is able to
drive towards a desired destination. Today, there are different paradigms addressing the …

Quadratic video interpolation

X Xu, L Siyao, W Sun, Q Yin… - Advances in Neural …, 2019 - proceedings.neurips.cc
Video interpolation is an important problem in computer vision, which helps overcome the
temporal limitation of camera sensors. Existing video interpolation methods usually assume …