Delivering arbitrary-modal semantic segmentation

J Zhang, R Liu, H Shi, K Yang, S Reiß… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …

Chatbridge: Bridging modalities with large language model as a language catalyst

Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu… - arxiv preprint arxiv …, 2023 - arxiv.org
Building general-purpose models that can perceive diverse real-world modalities and solve
various tasks is an appealing target in artificial intelligence. In this paper, we present …

Deep learning based 3D segmentation: A survey

Y He, H Yu, X Liu, Z Yang, W Sun, S Anwar… - arxiv preprint arxiv …, 2021 - arxiv.org
3D segmentation is a fundamental and challenging problem in computer vision with
applications in autonomous driving and robotics. It has received significant attention from the …

An efficient RGB-D indoor scene-parsing solution via lightweight multiflow intersection and knowledge distillation

W Zhou, Y Zhang, W Yan, L Ye - IEEE Journal of Selected …, 2024 - ieeexplore.ieee.org
The rapid progression of convolutional neural networks (CNNs) has significantly improved
indoor scene parsing, transforming the fields of robotics, autonomous navigation …

Cross-modal attention fusion network for RGB-D semantic segmentation

Q Zhao, Y Wan, J Xu, L Fang - Neurocomputing, 2023 - Elsevier
RGB-D semantic segmentation is crucial for robots to understand scenes. Most existing
methods take depth information as an additional input, leading to cross-modal semantic …

Object segmentation by mining cross-modal semantics

Z Wu, J Wang, Z Zhou, Z An, Q Jiang… - Proceedings of the 31st …, 2023 - dl.acm.org
Multi-sensor clues have shown promise for object segmentation, but inherent noise in each
sensor, as well as the calibration error in practice, may bias the segmentation accuracy. In …

Dual-modal non-local context guided multi-stage fusion for indoor RGB-D semantic segmentation

X Guo, W Ma, F Liang, Q Mi - Expert Systems with Applications, 2024 - Elsevier
Complementarily fusing RGB and depth images while effectively suppressing task-irrelevant
noise is crucial for achieving accurate indoor RGB-D semantic segmentation. In this paper …

[HTML][HTML] A Transformer-based multi-modal fusion network for semantic segmentation of high-resolution remote sensing imagery

Y Liu, K Gao, H Wang, Z Yang, P Wang, S Ji… - International Journal of …, 2024 - Elsevier
Semantic segmentation of high-resolution multispectral remote sensing image has been
intensely studied. However, the shadow occlusions, or the similar color and textures …

Deep learning based 3D segmentation in computer vision: A survey

Y He, H Yu, X Liu, Z Yang, W Sun, S Anwar, A Mian - Information Fusion, 2025 - Elsevier
Abstract 3D segmentation is a fundamental and challenging problem in computer vision with
applications in autonomous driving and robotics. It has received significant attention from the …

Cross-modal transformer for RGB-D semantic segmentation of production workshop objects

Q Ru, G Chen, T Zuo, X Liao - Pattern Recognition, 2023 - Elsevier
Scene understanding in a production workshop is an important technology to improve its
intelligence level, semantic segmentation of production workshop objects is an effective …