Unsupervised point cloud representation learning with deep neural networks: A survey

A **ao, J Huang, D Guan, X Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Point cloud data have been widely explored due to its superior accuracy and robustness
under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved …

Robo3d: Towards robust and reliable 3d perception against corruptions

L Kong, Y Liu, X Li, R Chen, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The robustness of 3D perception systems under natural corruptions from environments and
sensors is pivotal for safety-critical applications. Existing large-scale 3D perception datasets …

Flatformer: Flattened window attention for efficient point cloud transformer

Z Liu, X Yang, H Tang, S Yang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Transformer, as an alternative to CNN, has been proven effective in many modalities (eg,
texts and images). For 3D point cloud transformers, existing efforts focus primarily on …

FusionPlanner: A multi-task motion planner for mining trucks via multi-sensor fusion

S Teng, L Li, Y Li, X Hu, L Li, Y Ai, L Chen - Mechanical Systems and Signal …, 2024 - Elsevier
In recent years, significant achievements have been made in motion planning for intelligent
vehicles. However, as a typical unstructured environment, open-pit mining attracts limited …

Enable deep learning on mobile devices: Methods, systems, and applications

H Cai, J Lin, Y Lin, Z Liu, H Tang, H Wang… - ACM Transactions on …, 2022 - dl.acm.org
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …

Distrifusion: Distributed parallel inference for high-resolution diffusion models

M Li, T Cai, J Cao, Q Zhang, H Cai… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models have achieved great success in synthesizing high-quality images.
However generating high-resolution images with diffusion models is still challenging due to …

Sparsevit: Revisiting activation sparsity for efficient high-resolution vision transformer

X Chen, Z Liu, H Tang, L Yi… - Proceedings of the …, 2023 - openaccess.thecvf.com
High-resolution images enable neural networks to learn richer visual representations.
However, this improved performance comes at the cost of growing computational …

SceneScript: Reconstructing Scenes with an Autoregressive Structured Language Model

A Avetisyan, C **e, H Howard-Jenkins, TY Yang… - … on Computer Vision, 2024 - Springer
We introduce SceneScript, a method that directly produces full scene models as a sequence
of structured language commands using an autoregressive, token-based approach. Our …

Sparse tensor-based multiscale representation for point cloud geometry compression

J Wang, D Ding, Z Li, X Feng, C Cao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
This study develops a unified Point Cloud Geometry (PCG) compression method through the
processing of multiscale sparse tensor-based voxelized PCG. We call this compression …

Link: Linear kernel for lidar-based 3d perception

T Lu, X Ding, H Liu, G Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Extending the success of 2D Large Kernel to 3D perception is challenging due to: 1. the
cubically-increasing overhead in processing 3D data; 2. the optimization difficulties from …