Searching efficient 3d architectures with sparse point-voxel convolution

H Tang, Z Liu, S Zhao, Y Lin, J Lin, H Wang… - European conference on …, 2020 - Springer
Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive
safely. Given the limited hardware resources, existing 3D perception models are not able to …

Enable deep learning on mobile devices: Methods, systems, and applications

H Cai, J Lin, Y Lin, Z Liu, H Tang, H Wang… - ACM Transactions on …, 2022 - dl.acm.org
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …

Sparsevit: Revisiting activation sparsity for efficient high-resolution vision transformer

X Chen, Z Liu, H Tang, L Yi… - Proceedings of the …, 2023 - openaccess.thecvf.com
High-resolution images enable neural networks to learn richer visual representations.
However, this improved performance comes at the cost of growing computational …

Pvnas: 3d neural architecture search with point-voxel convolution

Z Liu, H Tang, S Zhao, K Shao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
3D neural networks are widely used in real-world applications (eg, AR/VR headsets, self-
driving cars). They are required to be fast and accurate; however, limited hardware …

Finding the task-optimal low-bit sub-distribution in deep neural networks

R Dong, Z Tan, M Wu, L Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press
Quantized neural networks typically require smaller memory footprints and lower
computation complexity, which is crucial for efficient deployment. However, quantization …

Alps: Adaptive quantization of deep neural networks with generalized posits

HF Langroudi, V Karia, Z Carmichael… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, a new adaptive quantization algorithm for generalized posit format is
presented, to optimally represent the dynamic range and distribution of deep neural network …

Design automation for fast, lightweight, and effective deep learning models: A survey

D Zhang, K Chen, Y Zhao, B Yang, L Yao… - arxiv preprint arxiv …, 2022 - arxiv.org
Deep learning technologies have demonstrated remarkable effectiveness in a wide range of
tasks, and deep learning holds the potential to advance a multitude of applications …

FxP-QNet: a post-training quantizer for the design of mixed low-precision DNNs with dynamic fixed-point representation

A Shawahna, SM Sait, A El-Maleh, I Ahmad - IEEE Access, 2022 - ieeexplore.ieee.org
Deep neural networks (DNNs) have demonstrated their effectiveness in a wide range of
computer vision tasks, with the state-of-the-art results obtained through complex and deep …

SANA: Sensitivity-Aware Neural Architecture Adaptation for Uniform Quantization

M Guo, Z Dong, K Keutzer - Applied Sciences, 2023 - mdpi.com
Uniform quantization is widely taken as an efficient compression method in practical
applications. Despite its merit of having a low computational overhead, uniform quantization …

Accelerable lottery tickets with the mixed-precision quantization

Z Li, Y Gong, Z Zhang, X Xue, T Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
In recent years, the lottery tickets hypothesis has gained widespread popularity as a means
of network compression. However, the practical application of lottery tickets for hardware …