Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Advancing 3D point cloud understanding through deep transfer learning: A comprehensive survey

SS Sohail, Y Himeur, H Kheddar, A Amira, F Fadli… - Information …, 2024 - Elsevier
The 3D point cloud (3DPC) has significantly evolved and benefited from the advance of
deep learning (DL). However, the latter faces various issues, including the lack of data or …

Point transformer v3: Simpler faster stronger

X Wu, L Jiang, PS Wang, Z Liu, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper is not motivated to seek innovation within the attention mechanism. Instead it
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …

Spherical transformer for lidar-based 3d recognition

X Lai, Y Chen, F Lu, J Liu, J Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
LiDAR-based 3D point cloud recognition has benefited various applications. Without
specially considering the LiDAR point distribution, most current methods suffer from …

Rethinking range view representation for lidar segmentation

L Kong, Y Liu, R Chen, Y Ma, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
LiDAR segmentation is crucial for autonomous driving perception. Recent trends favor point-
or voxel-based methods as they often yield better performance than the traditional range …

Clip2scene: Towards label-efficient 3d scene understanding by clip

R Chen, Y Liu, L Kong, X Zhu, Y Ma… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Contrastive Language-Image Pre-training (CLIP) achieves promising results in 2D
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …

Robo3d: Towards robust and reliable 3d perception against corruptions

L Kong, Y Liu, X Li, R Chen, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The robustness of 3D perception systems under natural corruptions from environments and
sensors is pivotal for safety-critical applications. Existing large-scale 3D perception datasets …

Learning 3d representations from 2d pre-trained models via image-to-point masked autoencoders

R Zhang, L Wang, Y Qiao, P Gao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Pre-training by numerous image data has become de-facto for robust 2D representations. In
contrast, due to the expensive data processing, a paucity of 3D datasets severely hinders …

Delivering arbitrary-modal semantic segmentation

J Zhang, R Liu, H Shi, K Yang, S Reiß… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …

Scpnet: Semantic scene completion on point cloud

Z **a, Y Liu, X Li, X Zhu, Y Ma, Y Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Training deep models for semantic scene completion is challenging due to the sparse and
incomplete input, a large quantity of objects of diverse scales as well as the inherent label …