HTQ: Exploring the High-Dimensional Trade-Off of mixed-precision quantization

Z Li, X Long, J **ao, Q Gu - Pattern Recognition, 2024 - Elsevier
Mixed-precision quantization, where more sensitive layers are kept at higher precision, can
achieve the trade-off between accuracy and complexity of neural networks. However, the …

GenQ: Quantization in Low Data Regimes with Generative Synthetic Data

Y Li, Y Kim, D Lee, S Kundu, P Panda - European Conference on …, 2024 - Springer
In the realm of deep neural network deployment, low-bit quantization presents a promising
avenue for enhancing computational efficiency. However, it often hinges on the availability …

Low Bit-Width Zero-Shot Quantization With Soft Feature-Infused Hints for IoT Systems

X Chen, Y Wang, Y Li, X Ling, M Li… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
Quantization has enabled the widespread implementation of deep learning algorithms on
resource-constrained Internet of Things (IoT) devices, which compresses neural networks by …

StableQ: Enhancing Data-Scarce Quantization with Text-to-Image Data

Y Li, Y Kim, D Lee, P Panda - arxiv preprint arxiv:2312.05272, 2023 - arxiv.org
Though low-bit quantization enables efficient storage and inference of deep neural
networks, it often requires the use of training data to maintain resilience against quantization …

Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers

Y Zhong, Y Zhou, Y Zhang, S Li, Y Li, F Chao… - arxiv preprint arxiv …, 2024 - arxiv.org
Data-free quantization (DFQ), which facilitates model quantization without real data to
address increasing concerns about data security, has garnered significant attention within …

Unsupervised Time Series Anomaly Prediction with Importance-based Generative Contrastive Learning

K Zhao, Z Zhuang, C Guo, H Miao, Y Cheng… - arxiv preprint arxiv …, 2024 - arxiv.org
Time series anomaly prediction plays an essential role in many real-world scenarios, such
as environmental prevention and prompt maintenance of cyber-physical systems. However …

Bit-width aware generator and intermediate layer knowledge distillation using channel-wise attention for generative data-free quantization

JY Baek, DH Hur, DW Kim, YS Yoo… - Journal of The Korea …, 2024 - koreascience.kr
In this paper, we propose the BAG (Bit-width Aware Generator) and the Intermediate Layer
Knowledge Distillation using Channel-wise Attention to reduce the knowledge gap between …