[HTML][HTML] Learning low-precision structured subnetworks using joint layerwise channel pruning and uniform quantization

X Zhang, I Colbert, S Das - Applied Sciences, 2022 - mdpi.com
Pruning and quantization are core techniques used to reduce the inference costs of deep
neural networks. Among the state-of-the-art pruning techniques, magnitude-based pruning …

[KNIHA][B] End-to-End Inference Optimization for Deep Learning-based Image Upsampling Networks

I Colbert - 2023 - search.proquest.com
Many computer vision problems require image upsampling, where the number of pixels per
unit area is increased by inferring values in high-dimensional image space from …

[PDF][PDF] Hardware-Aware Co-Optimization of Deep Convolutional Neural Networks

NK Jha - 2020 - researchgate.net
The unprecedented success of deep neural networks (DNNs), especially convolutional
neural networks (CNNs), stems from its high representational power and capability to model …

Training Deep Neural Networks with Joint Quantization and Pruning of Features and Weights

X Zhang, I Colbert, K Kreutz-Delgado, S Das - openreview.net
Quantization and pruning are widely used to reduce the inference costs of deep neural
networks. In this work, we propose a framework to train deep neural networks using novel …