HTQ: Exploring the High-Dimensional Trade-Off of mixed-precision quantization
Mixed-precision quantization, where more sensitive layers are kept at higher precision, can
achieve the trade-off between accuracy and complexity of neural networks. However, the …
achieve the trade-off between accuracy and complexity of neural networks. However, the …
GenQ: Quantization in Low Data Regimes with Generative Synthetic Data
In the realm of deep neural network deployment, low-bit quantization presents a promising
avenue for enhancing computational efficiency. However, it often hinges on the availability …
avenue for enhancing computational efficiency. However, it often hinges on the availability …
Low Bit-Width Zero-Shot Quantization With Soft Feature-Infused Hints for IoT Systems
X Chen, Y Wang, Y Li, X Ling, M Li… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
Quantization has enabled the widespread implementation of deep learning algorithms on
resource-constrained Internet of Things (IoT) devices, which compresses neural networks by …
resource-constrained Internet of Things (IoT) devices, which compresses neural networks by …
StableQ: Enhancing Data-Scarce Quantization with Text-to-Image Data
Though low-bit quantization enables efficient storage and inference of deep neural
networks, it often requires the use of training data to maintain resilience against quantization …
networks, it often requires the use of training data to maintain resilience against quantization …
Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers
Data-free quantization (DFQ), which facilitates model quantization without real data to
address increasing concerns about data security, has garnered significant attention within …
address increasing concerns about data security, has garnered significant attention within …
Unsupervised Time Series Anomaly Prediction with Importance-based Generative Contrastive Learning
Time series anomaly prediction plays an essential role in many real-world scenarios, such
as environmental prevention and prompt maintenance of cyber-physical systems. However …
as environmental prevention and prompt maintenance of cyber-physical systems. However …
Bit-width aware generator and intermediate layer knowledge distillation using channel-wise attention for generative data-free quantization
JY Baek, DH Hur, DW Kim, YS Yoo… - Journal of The Korea …, 2024 - koreascience.kr
In this paper, we propose the BAG (Bit-width Aware Generator) and the Intermediate Layer
Knowledge Distillation using Channel-wise Attention to reduce the knowledge gap between …
Knowledge Distillation using Channel-wise Attention to reduce the knowledge gap between …