A survey of techniques for optimizing transformer inference

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023 - Elsevier
Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …

Performance enhancement of artificial intelligence: A survey

M Krichen, MS Abdalzaher - Journal of Network and Computer Applications, 2024 - Elsevier
The advent of machine learning (ML) and Artificial intelligence (AI) has brought about a
significant transformation across multiple industries, as it has facilitated the automation of …

Revisiting the parameter efficiency of adapters from the perspective of precision redundancy

S Jie, H Wang, ZH Deng - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Current state-of-the-art results in computer vision depend in part on fine-tuning large pre-
trained vision models. However, with the exponential growth of model sizes, the …