FBQuant: FeedBack Quantization for Large Language Models

Y Liu, H Fang, L He, R Zhang, Y Bai, Y Du… - arxiv preprint arxiv …, 2025 - arxiv.org
Deploying Large Language Models (LLMs) on edge devices is increasingly important, as it
eliminates reliance on network connections, reduces expensive API calls, and enhances …