A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning

J Ma, X Li, Z Wang, X Zhang, S Yan, Y Chen… - Proceedings of the 61st …, 2024 - dl.acm.org
As deep learning empowers various fields, many domain-specific non-neural network
operators have been proposed to improve the accuracy of deep learning models …

ITIF: Integrated Transformers Inference Framework for Multiple Tenants on GPU

Y Zhang, Z Zhang, W Bao, D Yuan - Proceedings of the 52nd …, 2023 - dl.acm.org
Transformer models, which have gained prominence in recent years, serve as the backbone
for a wide range of deep learning applications, from natural language processing to …