Few-shot semantic segmentation: a review on recent approaches
Z Chang, Y Lu, X Ran, X Gao, X Wang - Neural Computing and …, 2023 - Springer
Few-shot semantic segmentation (FSS) is a challenging task that aims to learn to segment
novel categories with only a few labeled images, and it has a wide range of real-world …
novel categories with only a few labeled images, and it has a wide range of real-world …
Improving graph representation for point cloud segmentation via attentive filtering
Recently, self-attention networks achieve impressive performance in point cloud
segmentation due to their superiority in modeling long-range dependencies. However …
segmentation due to their superiority in modeling long-range dependencies. However …
Retro-fpn: Retrospective feature pyramid network for point cloud semantic segmentation
Learning per-point semantic features from the hierarchical feature pyramid is essential for
point cloud semantic segmentation. However, most previous methods suffered from …
point cloud semantic segmentation. However, most previous methods suffered from …
JSNet++: Dynamic filters and pointwise correlation for 3D point cloud instance and semantic segmentation
L Zhao, W Tao - IEEE Transactions on Circuits and Systems for …, 2022 - ieeexplore.ieee.org
In this paper, we propose a novel joint instance and semantic segmentation approach,
called JSNet++, to address the instance and semantic segmentation tasks of 3D point clouds …
called JSNet++, to address the instance and semantic segmentation tasks of 3D point clouds …
Rethinking Few-shot 3D Point Cloud Semantic Segmentation
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS) with a focus on
two significant issues in the state-of-the-art: foreground leakage and sparse point …
two significant issues in the state-of-the-art: foreground leakage and sparse point …
[HTML][HTML] Small but mighty: Enhancing 3d point clouds semantic segmentation with u-next framework
We investigate the problem of 3D point clouds semantic segmentation. Recently, a large
amount of research work has focused on local feature aggregation. However, the …
amount of research work has focused on local feature aggregation. However, the …
A multi-scale self-supervised hypergraph contrastive learning framework for video question answering
Video question answering (VideoQA) is a challenging video understanding task that
requires a comprehensive understanding of multimodal information and accurate answers to …
requires a comprehensive understanding of multimodal information and accurate answers to …
A joint deep learning network of point clouds and multiple views for roadside object classification from lidar point clouds
L Fang, Z You, G Shen, Y Chen, J Li - ISPRS Journal of Photogrammetry …, 2022 - Elsevier
Urban management and survey departments have begun investigating the feasibility of
acquiring data from various laser scanning systems for urban infrastructure measurements …
acquiring data from various laser scanning systems for urban infrastructure measurements …
Cross time-frequency transformer for temporal action localization
J Yang, P Wei, N Zheng - … on Circuits and Systems for Video …, 2023 - ieeexplore.ieee.org
Most modern approaches in temporal action localization (TAL) mainly focus on time domain
information, while neglecting the advantages of information from other domains. How to …
information, while neglecting the advantages of information from other domains. How to …
Classification of Typical Static Objects in Road Scenes Based on LO-Net
Y Li, J Wu, H Liu, J Ren, Z Xu, J Zhang, Z Wang - Remote Sensing, 2024 - mdpi.com
Mobile LiDAR technology is a powerful tool that accurately captures spatial information
about typical static objects in road scenes. However, the precise extraction and classification …
about typical static objects in road scenes. However, the precise extraction and classification …