I-vit: Integer-only quantization for efficient vision transformer inference Z Li, Q Gu International Conference on Computer Vision (ICCV), 2023, 2023 | 105 | 2023 |
Repq-vit: Scale reparameterization for post-training quantization of vision transformers Z Li, J Xiao, L Yang, Q Gu International Conference on Computer Vision (ICCV), 2023, 2023 | 84 | 2023 |
Llm inference unveiled: Survey and roofline model insights Z Yuan, Y Shang, Y Zhou, Z Dong, Z Zhou, C Xue, B Wu, Z Li, Q Gu, ... arXiv preprint arXiv:2402.16363, 2024 | 63 | 2024 |
Patch similarity aware data-free quantization for vision transformers Z Li, L Ma, M Chen, J Xiao, Q Gu European Conference on Computer Vision (ECCV), 2022, 2022 | 56 | 2022 |
PSAQ-ViT V2: Toward Accurate and General Data-Free Quantization for Vision Transformers Z Li, M Chen, J Xiao, Q Gu IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024, 2024 | 38 | 2024 |
Hardware-oriented algorithm for high-speed laser centerline extraction based on Hessian matrix Z Li, L Ma, X Long, Y Chen, H Deng, F Yan, Q Gu IEEE Transactions on Instrumentation and Measurement (TIM), 2021, 2021 | 27 | 2021 |
Enhanced distribution alignment for post-training quantization of diffusion models X Liu, Z Li, J Xiao, Q Gu arXiv e-prints, arXiv: 2401.04585, 2024 | 12 | 2024 |
Rethinking prediction alignment in one-stage object detection J Xiao, H Jiang, Z Li, Q Gu Neurocomputing, 2022, 2022 | 11 | 2022 |
Patch-wise Mixed-Precision Quantization of Vision Transformer J Xiao, Z Li, L Yang, Q Gu International Joint Conference on Neural Networks (IJCNN), 2023, 2023 | 9 | 2023 |
Dual-discriminator adversarial framework for data-free quantization Z Li, L Ma, X Long, J Xiao, Q Gu Neurocomputing, 2022, 2022 | 8 | 2022 |
DCIFPN: Deformable cross‐scale interaction feature pyramid network for object detection J Xiao, H Jiang, Z Li, Q Gu IET Image Processing, 2023, 2023 | 7 | 2023 |
Qft: Quantized full-parameter tuning of llms with affordable resources Z Li, X Liu, B Zhu, Z Dong, Q Gu, K Keutzer arXiv preprint arXiv:2310.07147, 2023 | 6 | 2023 |
Repquant: Towards accurate post-training quantization of large transformer models via scale reparameterization Z Li, X Liu, J Zhang, Q Gu arXiv preprint arXiv:2402.05628, 2024 | 4 | 2024 |
Region Probability Map-Guided Fast Wide-Area Multiobject Detection X Long, M Chen, Z Li, Q Gu IEEE Transactions on Instrumentation and Measurement (TIM), 2022, 2022 | 4 | 2022 |
Mechanical particle filter-based active vision system for fast wide-area multiobject detection X Long, L Ma, H Jiang, Z Li, Y Chen, Q Gu IEEE Transactions on Instrumentation and Measurement (TIM), 2022, 2022 | 4 | 2022 |
Binaryvit: Towards efficient and accurate binary vision transformers J Xiao, Z Li, L Yang, Q Gu IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024, 2024 | 3 | 2024 |
K-sort arena: Efficient and reliable benchmarking for generative models via k-wise human preferences Z Li, X Liu, D Fu, J Li, Q Gu, K Keutzer, Z Dong arXiv preprint arXiv:2408.14468, 2024 | 2 | 2024 |
MGRQ: Post-Training Quantization For Vision Transformer With Mixed Granularity Reconstruction L Yang, Z Li, J Xiao, H Gong, Q Gu International Conference on Image Processing (ICIP), 2024, 2024 | 2 | 2024 |
HTQ: Exploring the High-Dimensional Trade-Off of mixed-precision quantization Z Li, X Long, J Xiao, Q Gu Pattern Recognition (PR), 2024, 2024 | 1 | 2024 |
TTAQ: Towards Stable Post-training Quantization in Continuous Domain Adaptation J Xiao, Z Li, L Yang, Y Mei, Q Gu arXiv preprint arXiv:2412.09899, 2024 | | 2024 |