Hardware acceleration of LLMs: A comprehensive survey and comparison

N Koilia, C Kachris - ar** of Heterogeneous DNN Models on Adaptive Multi-Accelerator Systems
J Zhao, G Shen, W Ding, Q Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
As DNNs are develo** rapidly, the computational and memory burden imposed on
hardware systems grows exponentially. This becomes even more severe for large language …