Structured pruning for deep convolutional neural networks: A survey
The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …
attributed to their deeper and wider architectures, which can come with significant …
Transforming large-size to lightweight deep neural networks for IoT applications
Deep Neural Networks (DNNs) have gained unprecedented popularity due to their high-
order performance and automated feature extraction capability. This has encouraged …
order performance and automated feature extraction capability. This has encouraged …
Spvit: Enabling faster vision transformers via latency-aware soft token pruning
Abstract Recently, Vision Transformer (ViT) has continuously established new milestones in
the computer vision field, while the high computation and memory cost makes its …
the computer vision field, while the high computation and memory cost makes its …
Mest: Accurate and fast memory-economic sparse training framework on the edge
Recently, a new trend of exploring sparsity for accelerating neural network training has
emerged, embracing the paradigm of training on the edge. This paper proposes a novel …
emerged, embracing the paradigm of training on the edge. This paper proposes a novel …
Chex: Channel exploration for cnn model compression
Channel pruning has been broadly recognized as an effective technique to reduce the
computation and memory cost of deep convolutional neural networks. However …
computation and memory cost of deep convolutional neural networks. However …
Forms: Fine-grained polarized reram-based in-situ computation for mixed-signal dnn accelerator
Recent work demonstrated the promise of using resistive random access memory (ReRAM)
as an emerging technology to perform inherently parallel analog domain in-situ matrix …
as an emerging technology to perform inherently parallel analog domain in-situ matrix …
Accelerating federated learning for iot in big data analytics with pruning, quantization and selective updating
The ever-increasing number of Internet of Things (IoT) devices are continuously generating
huge masses of data, but the current cloud-centric approach for IoT big data analysis has …
huge masses of data, but the current cloud-centric approach for IoT big data analysis has …
Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights
Machine learning (ML) models are widely used in many important domains. For efficiently
processing these computational-and memory-intensive applications, tensors of these …
processing these computational-and memory-intensive applications, tensors of these …
A comprehensive review of model compression techniques in machine learning
This paper critically examines model compression techniques within the machine learning
(ML) domain, emphasizing their role in enhancing model efficiency for deployment in …
(ML) domain, emphasizing their role in enhancing model efficiency for deployment in …
Teachers do more than teach: Compressing image-to-image models
Abstract Generative Adversarial Networks (GANs) have achieved huge success in
generating high-fidelity images, however, they suffer from low efficiency due to tremendous …
generating high-fidelity images, however, they suffer from low efficiency due to tremendous …