Searching efficient 3d architectures with sparse point-voxel convolution
Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive
safely. Given the limited hardware resources, existing 3D perception models are not able to …
safely. Given the limited hardware resources, existing 3D perception models are not able to …
Enable deep learning on mobile devices: Methods, systems, and applications
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …
intelligence (AI), including computer vision, natural language processing, and speech …
Sparsevit: Revisiting activation sparsity for efficient high-resolution vision transformer
High-resolution images enable neural networks to learn richer visual representations.
However, this improved performance comes at the cost of growing computational …
However, this improved performance comes at the cost of growing computational …
Pvnas: 3d neural architecture search with point-voxel convolution
3D neural networks are widely used in real-world applications (eg, AR/VR headsets, self-
driving cars). They are required to be fast and accurate; however, limited hardware …
driving cars). They are required to be fast and accurate; however, limited hardware …
Finding the task-optimal low-bit sub-distribution in deep neural networks
Quantized neural networks typically require smaller memory footprints and lower
computation complexity, which is crucial for efficient deployment. However, quantization …
computation complexity, which is crucial for efficient deployment. However, quantization …
Alps: Adaptive quantization of deep neural networks with generalized posits
In this paper, a new adaptive quantization algorithm for generalized posit format is
presented, to optimally represent the dynamic range and distribution of deep neural network …
presented, to optimally represent the dynamic range and distribution of deep neural network …
Design automation for fast, lightweight, and effective deep learning models: A survey
Deep learning technologies have demonstrated remarkable effectiveness in a wide range of
tasks, and deep learning holds the potential to advance a multitude of applications …
tasks, and deep learning holds the potential to advance a multitude of applications …
FxP-QNet: a post-training quantizer for the design of mixed low-precision DNNs with dynamic fixed-point representation
Deep neural networks (DNNs) have demonstrated their effectiveness in a wide range of
computer vision tasks, with the state-of-the-art results obtained through complex and deep …
computer vision tasks, with the state-of-the-art results obtained through complex and deep …
SANA: Sensitivity-Aware Neural Architecture Adaptation for Uniform Quantization
Uniform quantization is widely taken as an efficient compression method in practical
applications. Despite its merit of having a low computational overhead, uniform quantization …
applications. Despite its merit of having a low computational overhead, uniform quantization …
Accelerable lottery tickets with the mixed-precision quantization
In recent years, the lottery tickets hypothesis has gained widespread popularity as a means
of network compression. However, the practical application of lottery tickets for hardware …
of network compression. However, the practical application of lottery tickets for hardware …