Aligning cyber space with physical world: A comprehensive survey on embodied ai

Y Liu, W Chen, Y Bai, X Liang, G Li, W Gao… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …

Pointmamba: A simple state space model for point cloud analysis

D Liang, X Zhou, W Xu, X Zhu, Z Zou… - Advances in neural …, 2025‏ - proceedings.neurips.cc
Transformers have become one of the foundational architectures in point cloud analysis
tasks due to their excellent global modeling ability. However, the attention mechanism has …

Oa-cnns: Omni-adaptive sparse cnns for 3d semantic segmentation

B Peng, X Wu, L Jiang, Y Chen… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
The booming of 3D recognition in the 2020s began with the introduction of point cloud
transformers. They quickly overwhelmed sparse CNNs and became state-of-the-art models …

Unlearnable 3D point clouds: Class-wise transformation is all you need

X Wang, M Li, W Liu, H Zhang, S Hu… - Advances in …, 2025‏ - proceedings.neurips.cc
Traditional unlearnable strategies have been proposed to prevent unauthorized users from
training on the 2D image data. With more 3D point cloud data containing sensitivity …

Voxel mamba: Group-free state space models for point cloud based 3d object detection

G Zhang, L Fan, C He, Z Lei… - Advances in Neural …, 2025‏ - proceedings.neurips.cc
Serialization-based methods, which serialize the 3D voxels and group them into multiple
sequences before inputting to Transformers, have demonstrated their effectiveness in 3D …

Point mamba: A novel point cloud backbone based on state space model with octree-based ordering strategy

J Liu, R Yu, Y Wang, Y Zheng, T Deng, W Ye… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Recently, state space model (SSM) has gained great attention due to its promising
performance, linear complexity, and long sequence modeling ability in both language and …

Groupcontrast: Semantic-aware self-supervised representation learning for 3d understanding

C Wang, L Jiang, X Wu, Z Tian… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
Self-supervised 3D representation learning aims to learn effective representations from
large-scale unlabeled point clouds. Most existing approaches adopt point discrimination as …

Large spatial model: End-to-end unposed images to semantic 3d

Z Fan, J Zhang, W Cong, P Wang, R Li… - Advances in …, 2025‏ - proceedings.neurips.cc
Reconstructing and understanding 3D structures from a limited number of images is a
classical problem in computer vision. Traditional approaches typically decompose this task …

Deep learning based 3D segmentation: A survey

Y He, H Yu, X Liu, Z Yang, W Sun, S Anwar… - arxiv preprint arxiv …, 2021‏ - arxiv.org
3D segmentation is a fundamental and challenging problem in computer vision with
applications in autonomous driving and robotics. It has received significant attention from the …

Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model

X Han, Y Tang, Z Wang, X Li - … of the 32nd ACM International Conference …, 2024‏ - dl.acm.org
Existing Transformer-based models for point cloud analysis suffer from quadratic complexity,
leading to compromised point cloud resolution and information loss. In contrast, the newly …