Dynamic channel pruning: Feature boosting and suppression

X Gao, Y Zhao, Ł Dudziak, R Mullins, C Xu - arxiv preprint arxiv …, 2018 - arxiv.org
Making deep convolutional neural networks more accurate typically comes at the cost of
increased computational and memory resources. In this paper, we reduce this cost by …

Automatic generation of multi-precision multi-arithmetic CNN accelerators for FPGAs

Y Zhao, X Gao, X Guo, J Liu, E Wang… - … Conference on Field …, 2019 - ieeexplore.ieee.org
Modern deep Convolutional Neural Networks (CNNs) are computationally demanding, yet
real applications often require high throughput and low latency. To help tackle these …

Storage efficient and dynamic flexible runtime channel pruning via deep reinforcement learning

J Chen, S Chen, SJ Pan - Advances in neural information …, 2020 - proceedings.neurips.cc
In this paper, we propose a deep reinforcement learning (DRL) based framework to
efficiently perform runtime channel pruning on convolutional neural networks (CNNs). Our …

Focused quantization for sparse CNNs

Y Zhao, X Gao, D Bates, R Mullins… - Advances in Neural …, 2019 - proceedings.neurips.cc
Deep convolutional neural networks (CNNs) are powerful tools for a wide range of vision
tasks, but the enormous amount of memory and compute resources required by CNNs …

Self-adaptive network pruning

J Chen, Z Zhu, C Li, Y Zhao - … 2019, Sydney, NSW, Australia, December 12 …, 2019 - Springer
Deep convolutional neural networks have been proved successful on a wide range of tasks,
yet they are still hindered by their large computation cost in many industrial scenarios. In this …

Characterizing sources of ineffectual computations in deep learning networks

M Nikolić, M Mahmoud, A Moshovos… - … Analysis of Systems …, 2019 - ieeexplore.ieee.org
Hardware accelerators for inference with neural networks can take advantage of the
properties of data they process. Performance gains and reduced memory bandwidth during …

CNN-based object detection on low precision hardware: Racing car case study

N De Rita, A Aimar, T Delbruck - 2019 IEEE Intelligent Vehicles …, 2019 - ieeexplore.ieee.org
Increasing interest in deep learning and convolutional neural networks resulted in the last
years in multiple techniques aiming to improve their accuracy, training speed, and inference …

Lc: A flexible, extensible open-source toolkit for model compression

Y Idelbayev, MÁ Carreira-Perpiñán - Proceedings of the 30th ACM …, 2021 - dl.acm.org
The continued increase in memory, runtime and energy consumption of deployed machine
learning models on one side, and the trend to miniaturize intelligent devices and sensors on …

Now that I can see, I can improve: Enabling data-driven finetuning of CNNs on the edge

A Rajagopal, CS Bouganis - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
In today's world, a vast amount of data is being generated by edge devices that can be used
as valuable training data to improve the performance of machine learning algorithms in …

End-to-end deep reinforcement learning in computer systems

M Schaarschmidt - 2020 - repository.cam.ac.uk
The growing complexity of data processing systems has long led systems designers to
imagine systems (eg databases, schedulers) which can self-configure and adapt based on …