Delivering arbitrary-modal semantic segmentation

J Zhang, R Liu, H Shi, K Yang, S Reiß… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …

Explicit attention-enhanced fusion for RGB-thermal perception tasks

M Liang, J Hu, C Bao, H Feng, F Deng… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Recently, RGB-Thermal based perception has shown significant advances. Thermal
information provides useful clues when visual cameras suffer from poor lighting conditions …

Global contextually guided lightweight network for RGB-thermal urban scene understanding

T Gong, W Zhou, X Qian, J Lei, L Yu - Engineering Applications of Artificial …, 2023 - Elsevier
Recent achievements in scene understanding have benefited considerably from the rapid
development of convolutional neural networks. However, improvements of scene …

Dformer: Rethinking rgbd representation learning for semantic segmentation

B Yin, X Zhang, Z Li, L Liu, MM Cheng… - arxiv preprint arxiv …, 2023 - arxiv.org
We present DFormer, a novel RGB-D pretraining framework to learn transferable
representations for RGB-D segmentation tasks. DFormer has two new key innovations: 1) …

MMSMCNet: Modal memory sharing and morphological complementary networks for RGB-T urban scene semantic segmentation

W Zhou, H Zhang, W Yan, W Lin - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Combining color (RGB) images with thermal images can facilitate semantic segmentation of
poorly lit urban scenes. However, for RGB-thermal (RGB-T) semantic segmentation, most …

Graph attention guidance network with knowledge distillation for semantic segmentation of remote sensing images

W Zhou, X Fan, W Yan, S Shan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep learning has become a popular method for studying the semantic segmentation of
high-resolution remote sensing images (HRRSIs). Existing methods have adopted …

Multilanguage transformer for improved text to remote sensing image retrieval

MM Al Rahhal, Y Bazi, NA Alsharif… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Cross-modal text-image retrieval in remote sensing (RS) provides a flexible retrieval
experience for mining useful information from RS repositories. However, existing methods …

Position-aware relation learning for rgb-thermal salient object detection

H Zhou, C Tian, Z Zhang, C Li, Y Ding… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Salient object detection (SOD) is an important task in computer vision that aims to identify
visually conspicuous regions in images. RGB-Thermal SOD combines two spectra to …

MGSGNet-S*: Multilayer guided Semantic graph network via knowledge distillation for RGB-thermal urban scene parsing

W Zhou, H Wu, Q Jiang - IEEE Transactions on Intelligent …, 2024 - ieeexplore.ieee.org
Owing to rapid developments in driverless technologies, vision tasks for unmanned vehicles
have gained considerable attention, particularly in multimodal-based urban scene parsing …

: Edge-Aware Multimodal Transformer for RGB-D Salient Object Detection

G Chen, Q Wang, B Dong, R Ma, N Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
RGB-D salient object detection (SOD) has gained tremendous attention in recent years. In
particular, transformer has been employed and shown great potential. However, existing …