A survey on efficient convolutional neural networks and hardware acceleration
Over the past decade, deep-learning-based representations have demonstrated remarkable
performance in academia and industry. The learning capability of convolutional neural …
performance in academia and industry. The learning capability of convolutional neural …
Structured pruning for deep convolutional neural networks: A survey
The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …
attributed to their deeper and wider architectures, which can come with significant …
A survey of quantization methods for efficient neural network inference
This chapter provides approaches to the problem of quantizing the numerical values in deep
Neural Network computations, covering the advantages/disadvantages of current methods …
Neural Network computations, covering the advantages/disadvantages of current methods …
AutoML: A survey of the state-of-the-art
Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …
such as image recognition, object detection, and language modeling. However, building a …
Searching efficient 3d architectures with sparse point-voxel convolution
Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive
safely. Given the limited hardware resources, existing 3D perception models are not able to …
safely. Given the limited hardware resources, existing 3D perception models are not able to …
Efficientvit: Lightweight multi-scale attention for high-resolution dense prediction
High-resolution dense prediction enables many appealing real-world applications, such as
computational photography, autonomous driving, etc. However, the vast computational cost …
computational photography, autonomous driving, etc. However, the vast computational cost …
Spatten: Efficient sparse attention architecture with cascade token and head pruning
The attention mechanism is becoming increasingly popular in Natural Language Processing
(NLP) applications, showing superior performance than convolutional and recurrent …
(NLP) applications, showing superior performance than convolutional and recurrent …
Quantumnas: Noise-adaptive search for robust quantum circuits
Quantum noise is the key challenge in Noisy Intermediate-Scale Quantum (NISQ)
computers. Previous work for mitigating noise has primarily focused on gate-level or pulse …
computers. Previous work for mitigating noise has primarily focused on gate-level or pulse …
Hardware and software optimizations for accelerating deep neural networks: Survey of current trends, challenges, and the road ahead
Currently, Machine Learning (ML) is becoming ubiquitous in everyday life. Deep Learning
(DL) is already present in many applications ranging from computer vision for medicine to …
(DL) is already present in many applications ranging from computer vision for medicine to …
Enable deep learning on mobile devices: Methods, systems, and applications
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …
intelligence (AI), including computer vision, natural language processing, and speech …