On the synergies between machine learning and binocular stereo for depth estimation from images: A survey

M Poggi, F Tosi, K Batsos, P Mordohai… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Stereo matching is one of the longest-standing problems in computer vision with close to 40
years of studies and research. Throughout the years the paradigm has shifted from local …

Deep learning-based depth estimation methods from monocular image and videos: A comprehensive survey

U Rajapaksha, F Sohel, H Laga, D Diepeveen… - ACM Computing …, 2024 - dl.acm.org
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …

Self-supervised monocular depth estimation with multiscale perception

Y Zhang, M Gong, J Li, M Zhang… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Extracting 3D information from a single optical image is very attractive. Recently emerging
self-supervised methods can learn depth representations without using ground truth depth …

Unsupervised monocular depth estimation using attention and multi-warp reconstruction

C Ling, X Zhang, H Chen - IEEE Transactions on Multimedia, 2021 - ieeexplore.ieee.org
Monocular depth estimation has become one of the most studied topics in computer vision.
Most approaches treat depth prediction as a fully supervised regression problem requiring …

DPNet: Detail-preserving network for high quality monocular depth estimation

X Ye, S Chen, R Xu - Pattern Recognition, 2021 - Elsevier
Existing monocular depth estimation methods are unsatisfactory due to the inaccurate
inference of depth details and the loss of spatial information. In this paper, we present a …

Unsupervised monocular depth estimation via recursive stereo distillation

X Ye, X Fan, M Zhang, R Xu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Existing unsupervised monocular depth estimation methods resort to stereo image pairs
instead of ground-truth depth maps as supervision to predict scene depth. Constrained by …

Probabilistic graph attention network with conditional kernels for pixel-wise prediction

D Xu, X Alameda-Pineda, W Ouyang… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
Multi-scale representations deeply learned via convolutional neural networks have shown
tremendous importance for various pixel-level prediction problems. In this paper we present …

Laplacian pyramid neural network for dense continuous-value regression for complex scenes

X Chen, X Chen, Y Zhang, X Fu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Many computer vision tasks, such as monocular depth estimation and height estimation from
a satellite orthophoto, have a common underlying goal, which is regression of dense …

Improving 2D object detection with binocular images for outdoor surveillance

F Chu, Y Pang, J Cao, J Nie, X Li - Neurocomputing, 2022 - Elsevier
Detecting objects and providing their 2D information (eg, size and center) are crucial for
outdoor visual surveillance. Because the cameras are static and their distances to …