3D object detection for autonomous driving: A comprehensive survey
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …
Unsupervised point cloud representation learning with deep neural networks: A survey
Point cloud data have been widely explored due to its superior accuracy and robustness
under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved …
under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved …
Convnext v2: Co-designing and scaling convnets with masked autoencoders
Driven by improved architectures and better representation learning frameworks, the field of
visual recognition has enjoyed rapid modernization and performance boost in the early …
visual recognition has enjoyed rapid modernization and performance boost in the early …
Point-bert: Pre-training 3d point cloud transformers with masked point modeling
We present Point-BERT, a novel paradigm for learning Transformers to generalize the
concept of BERT onto 3D point cloud. Following BERT, we devise a Masked Point Modeling …
concept of BERT onto 3D point cloud. Following BERT, we devise a Masked Point Modeling …
Masked autoencoders for point cloud self-supervised learning
As a promising scheme of self-supervised learning, masked autoencoding has significantly
advanced natural language processing and computer vision. Inspired by this, we propose a …
advanced natural language processing and computer vision. Inspired by this, we propose a …
Dense contrastive learning for self-supervised visual pre-training
To date, most existing self-supervised learning methods are designed and optimized for
image classification. These pre-trained models can be sub-optimal for dense prediction …
image classification. These pre-trained models can be sub-optimal for dense prediction …
Crosspoint: Self-supervised cross-modal contrastive learning for 3d point cloud understanding
M Afham, I Dissanayake… - Proceedings of the …, 2022 - openaccess.thecvf.com
Manual annotation of large-scale point cloud dataset for varying tasks such as 3D object
classification, segmentation and detection is often laborious owing to the irregular structure …
classification, segmentation and detection is often laborious owing to the irregular structure …
Point-m2ae: multi-scale masked autoencoders for hierarchical point cloud pre-training
Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for
language and 2D image transformers. However, it still remains an open question on how to …
language and 2D image transformers. However, it still remains an open question on how to …
Clip2scene: Towards label-efficient 3d scene understanding by clip
Abstract Contrastive Language-Image Pre-training (CLIP) achieves promising results in 2D
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …
Point Transformer V3: Simpler Faster Stronger
This paper is not motivated to seek innovation within the attention mechanism. Instead it
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …