Large language models for compiler optimization

C Cummins, V Seeker, D Grubisic, M Elhoushi… - arxiv preprint arxiv …, 2023 - arxiv.org
We explore the novel application of Large Language Models to code optimization. We
present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly …

Tpugraphs: A performance prediction dataset on large tensor computational graphs

M Phothilimthana, S Abu-El-Haija… - Advances in …, 2023 - proceedings.neurips.cc
Precise hardware performance models play a crucial role in code optimizations. They can
assist compilers in making heuristic decisions or aid autotuners in identifying the optimal …

Meta large language model compiler: Foundation models of compiler optimization

C Cummins, V Seeker, D Grubisic, B Roziere… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) have demonstrated remarkable capabilities across a
variety of software engineering and coding tasks. However, their application in the domain of …

Telamalloc: Efficient on-chip memory allocation for production machine learning accelerators

M Maas, U Beaugnon, A Chauhan, B Ilbeyi - Proceedings of the 28th …, 2022 - dl.acm.org
Memory buffer allocation for on-chip memories is a major challenge in modern machine
learning systems that target ML accelerators. In interactive systems such as mobile phones …

Compiler generated feedback for Large Language Models

D Grubisic, C Cummins, V Seeker… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce a novel paradigm in compiler optimization powered by Large Language
Models with compiler feedback to optimize the code size of LLVM assembly. The model …

ALT: Breaking the wall between data layout and loop optimizations for deep learning compilation

Z Xu, J Xu, H Peng, W Wang, X Wang, H Wan… - Proceedings of the …, 2023 - dl.acm.org
Deep learning models rely on highly optimized tensor libraries for efficient inference on
heterogeneous hardware. Current deep compilers typically predetermine layouts of tensors …

Understanding LLM Embeddings for Regression

E Tang, B Yang, X Song - arxiv preprint arxiv:2411.14708, 2024 - arxiv.org
With the rise of large language models (LLMs) for flexibly processing information as strings,
a natural application is regression, specifically by preprocessing string representations into …

Mlgoperf: An ml guided inliner to optimize performance

AH Ashouri, M Elhoushi, Y Hua, X Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
For the past 25 years, we have witnessed an extensive application of Machine Learning to
the Compiler space; the selection and the phase-ordering problem. However, limited works …

Neural architecture search using property guided synthesis

C **, PM Phothilimthana, S Roy - Proceedings of the ACM on …, 2022 - dl.acm.org
Neural architecture search (NAS) has become an increasingly important tool within the deep
learning community in recent years, yielding many practical advancements in the design of …

Saturn: An Optimized Data System for Large Model Deep Learning Workloads

K Nagrecha, A Kumar - arxiv preprint arxiv:2309.01226, 2023 - arxiv.org
Large language models such as GPT-3 & ChatGPT have transformed deep learning (DL),
powering applications that have captured the public's imagination. These models are rapidly …