Few-shot semantic segmentation: a review on recent approaches

Z Chang, Y Lu, X Ran, X Gao, X Wang - Neural Computing and …, 2023 - Springer
Few-shot semantic segmentation (FSS) is a challenging task that aims to learn to segment
novel categories with only a few labeled images, and it has a wide range of real-world …

Improving graph representation for point cloud segmentation via attentive filtering

N Zhang, Z Pan, TH Li, W Gao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recently, self-attention networks achieve impressive performance in point cloud
segmentation due to their superiority in modeling long-range dependencies. However …

Retro-fpn: Retrospective feature pyramid network for point cloud semantic segmentation

P **ang, X Wen, YS Liu, H Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning per-point semantic features from the hierarchical feature pyramid is essential for
point cloud semantic segmentation. However, most previous methods suffered from …

JSNet++: Dynamic filters and pointwise correlation for 3D point cloud instance and semantic segmentation

L Zhao, W Tao - IEEE Transactions on Circuits and Systems for …, 2022 - ieeexplore.ieee.org
In this paper, we propose a novel joint instance and semantic segmentation approach,
called JSNet++, to address the instance and semantic segmentation tasks of 3D point clouds …

Rethinking Few-shot 3D Point Cloud Semantic Segmentation

Z An, G Sun, Y Liu, F Liu, Z Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS) with a focus on
two significant issues in the state-of-the-art: foreground leakage and sparse point …

[HTML][HTML] Small but mighty: Enhancing 3d point clouds semantic segmentation with u-next framework

Z Zeng, Q Hu, Z **e, B Li, J Zhou, Y Xu - International Journal of Applied …, 2025 - Elsevier
We investigate the problem of 3D point clouds semantic segmentation. Recently, a large
amount of research work has focused on local feature aggregation. However, the …

A multi-scale self-supervised hypergraph contrastive learning framework for video question answering

Z Wang, B Wu, K Ota, M Dong, H Li - Neural Networks, 2023 - Elsevier
Video question answering (VideoQA) is a challenging video understanding task that
requires a comprehensive understanding of multimodal information and accurate answers to …

A joint deep learning network of point clouds and multiple views for roadside object classification from lidar point clouds

L Fang, Z You, G Shen, Y Chen, J Li - ISPRS Journal of Photogrammetry …, 2022 - Elsevier
Urban management and survey departments have begun investigating the feasibility of
acquiring data from various laser scanning systems for urban infrastructure measurements …

Cross time-frequency transformer for temporal action localization

J Yang, P Wei, N Zheng - … on Circuits and Systems for Video …, 2023 - ieeexplore.ieee.org
Most modern approaches in temporal action localization (TAL) mainly focus on time domain
information, while neglecting the advantages of information from other domains. How to …

Classification of Typical Static Objects in Road Scenes Based on LO-Net

Y Li, J Wu, H Liu, J Ren, Z Xu, J Zhang, Z Wang - Remote Sensing, 2024 - mdpi.com
Mobile LiDAR technology is a powerful tool that accurately captures spatial information
about typical static objects in road scenes. However, the precise extraction and classification …