Semantic image segmentation: Two decades of research

G Csurka, R Volpi, B Chidlovskii - Foundations and Trends® …, 2022 - nowpublishers.com
Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer
vision applications, providing key information for the global understanding of an image. This …

Deep learning-based depth estimation methods from monocular image and videos: A comprehensive survey

U Rajapaksha, F Sohel, H Laga, D Diepeveen… - ACM Computing …, 2024 - dl.acm.org
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …

[HTML][HTML] GCNDepth: Self-supervised monocular depth estimation based on graph convolutional network

A Masoumian, HA Rashwan, S Abdulwahab… - Neurocomputing, 2023 - Elsevier
Depth estimation is a challenging task of 3D reconstruction to enhance the accuracy sensing
of environment awareness. This work brings a new solution with improvements, which …

Aggregating global features into local vision transformer

K Patel, AM Bur, F Li, G Wang - 2022 26th International …, 2022 - ieeexplore.ieee.org
Local Transformer-based classification models have recently achieved promising results
with relatively low computational costs. However, the effect of aggregating spatial global …

[PDF][PDF] Real Objects Understanding Using 3D Haptic Virtual Reality for E-Learning Education.

SA Chelloug, H Ashfaq, SA Alsuhibany… - … , Materials & Continua, 2023 - researchgate.net
In the past two decades, there has been a lot of work on computer vision technology that
incorporates many tasks which implement basic filtering to image classification. The major …

Bridgenet: A joint learning network of depth map super-resolution and monocular depth estimation

Q Tang, R Cong, R Sheng, L He, D Zhang… - Proceedings of the 29th …, 2021 - dl.acm.org
Depth map super-resolution is a task with high practical application requirements in the
industry. Existing color-guided depth map super-resolution methods usually necessitate an …

Few-shot learning by integrating spatial and frequency representation

X Chen, G Wang - 2021 18th Conference on Robots and Vision …, 2021 - ieeexplore.ieee.org
Human beings can recognize new objects with only a few labeled examples, however, few-
shot learning remains a challenging problem for machine learning systems. Most previous …

Joint depth prediction and semantic segmentation with multi-view sam

M Shvets, D Zhao, M Niethammer… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-task approaches to joint depth and segmentation prediction are well-studied for
monocular images. Yet, predictions from a single-view are inherently limited, while multiple …

Efficient multi-task uncertainties for joint semantic segmentation and monocular depth estimation

S Landgraf, M Hillemann, T Kapler, M Ulrich - arxiv preprint arxiv …, 2024 - arxiv.org
Quantifying the predictive uncertainty emerged as a possible solution to common challenges
like overconfidence or lack of explainability and robustness of deep neural networks, albeit …

An unsupervised domain adaptation model based on dual-module adversarial training

Y Yang, T Zhang, G Li, T Kim, G Wang - Neurocomputing, 2022 - Elsevier
In this paper, we propose a dual-module network architecture that employs a domain
discriminative feature module to encourage the domain invariant feature module to learn …