Unsupervised point cloud representation learning with deep neural networks: A survey

A **ao, J Huang, D Guan, X Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Point cloud data have been widely explored due to its superior accuracy and robustness
under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved …

Segment any point cloud sequences by distilling vision foundation models

Y Liu, L Kong, J Cen, R Chen… - Advances in …, 2023 - proceedings.neurips.cc
Recent advancements in vision foundation models (VFMs) have opened up new
possibilities for versatile and efficient visual perception. In this work, we introduce Seal, a …

Ponderv2: Pave the way for 3d foundation model with a universal pre-training paradigm

H Zhu, H Yang, X Wu, D Huang, S Zhang, X He… - arxiv preprint arxiv …, 2023 - arxiv.org
In contrast to numerous NLP and 2D computer vision foundational models, the learning of a
robust and highly generalized 3D foundational model poses considerably greater …

Ponder: Point cloud pre-training via neural rendering

D Huang, S Peng, T He, H Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a novel approach to self-supervised learning of point cloud representations by
differentiable neural rendering. Motivated by the fact that informative point cloud features …

Visual atoms: Pre-training vision transformers with sinusoidal waves

S Takashima, R Hayamizu, N Inoue… - Proceedings of the …, 2023 - openaccess.thecvf.com
Formula-driven supervised learning (FDSL) has been shown to be an effective method for
pre-training vision transformers, where ExFractalDB-21k was shown to exceed the pre …

Fac: 3d representation learning via foreground aware feature contrast

K Liu, A **ao, X Zhang, S Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Contrastive learning has recently demonstrated great potential for unsupervised pre-training
in 3D scene understanding tasks. However, most existing work randomly selects point …

Segrcdb: Semantic segmentation via formula-driven supervised learning

R Shinoda, R Hayamizu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Pre-training is a strong strategy for enhancing visual models to efficiently train them with a
limited number of labeled images. In semantic segmentation, creating annotation masks …

Self-supervised learning for pre-training 3d point clouds: A survey

B Fei, W Yang, L Liu, T Luo, R Zhang, Y Li… - arxiv preprint arxiv …, 2023 - arxiv.org
Point cloud data has been extensively studied due to its compact form and flexibility in
representing complex 3D structures. The ability of point cloud data to accurately capture and …

PlantSegNet: 3D point cloud instance segmentation of nearby plant organs with identical semantics

A Zarei, B Li, JC Schnable, E Lyons, D Pauli… - … and Electronics in …, 2024 - Elsevier
In this study, we introduce PlantSegNet, a novel neural network model for instance
segmentation of nearby objects with similar geometric structures. Our work addresses the …

Joint representation learning for text and 3d point cloud

R Huang, X Pan, H Zheng, H Jiang, Z **e, C Wu… - Pattern Recognition, 2024 - Elsevier
Recent advancements in vision-language pre-training (eg, CLIP) have enabled 2D vision
models to benefit from language supervision. However, the joint representation learning of …