Transformer fusion and pixel-level contrastive learning for RGB-D salient object detection

J Wu, F Hao, W Liang, J Xu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Current RGB-D salient object detection (RGB-D SOD) methods mainly develop a
generalizable model trained by binary cross-entropy (BCE) loss based on convolutional or …

Real-time monocular human depth estimation and segmentation on embedded systems

S An, F Zhou, M Yang, H Zhu, C Fu… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
Estimating a scene's depth to achieve collision avoidance against moving pedestrians is a
crucial and fundamental problem in the robotic field. This paper proposes a novel, low …

Relational conduction graph network for intelligent fault diagnosis of rotating machines under small fault samples

Z Chen, X Wang, J Wu, C Deng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Fault samples obtained in real-world environment are limited, which makes it hard to
diagnose faults of rotating machines (RM) accurately by using the existing intelligent …

GAM-Depth: Self-Supervised Indoor Depth Estimation Leveraging a Gradient-Aware Mask and Semantic Constraints

A Cheng, Z Yang, H Zhu, K Mao - arxiv preprint arxiv:2402.14354, 2024 - arxiv.org
Self-supervised depth estimation has evolved into an image reconstruction task that
minimizes a photometric loss. While recent methods have made strides in indoor depth …

Zero-faulty sample machinery fault detection via relation network with out-of-distribution data augmentation

Z Chen, HZ Huang, J Wu, Y Wang - Engineering Applications of Artificial …, 2025 - Elsevier
Zero-faulty sample machinery fault detection is extremely challenging. However, it is
common issue that fault samples are extremely difficult to obtain, which limits the …

Geometric Constraints in Deep Learning Frameworks: A Survey

VK Vats, DJ Crandall - arxiv preprint arxiv:2403.12431, 2024 - arxiv.org
Stereophotogrammetry is an emerging technique of scene understanding. Its origins go
back to at least the 1800s when people first started to investigate using photographs to …

What makes the unsupervised monocular depth estimation (UMDE) model training better

X Wang, B Liang, M Yang, W Li - Scientific Reports, 2022 - nature.com
Current computer vision tasks based on deep learning require a huge amount of data with
annotations for model training or testing, especially in some dense estimation tasks, such as …

Improving Monocular Depth Estimation by Semantic Pre-training

P Rottmann, T Posewsky, A Milioto… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
Knowing the distance to nearby objects is crucial for autonomous cars to navigate safely in
everyday traffic. In this paper, we investigate monocular depth estimation, which advanced …

Exploring the Impacts from Datasets to Monocular Depth Estimation (MDE) Models with MineNavi

X Wang, B Liang, M Yang, W Li - arxiv preprint arxiv:2008.08454, 2020 - arxiv.org
Current computer vision tasks based on deep learning require a huge amount of data with
annotations for model training or testing, especially in some dense estimation tasks, such as …