- Academic Search

C Cummins, V Seeker, D Grubisic, M Elhoushi… - arxiv preprint arxiv …, 2023 - arxiv.org

We explore the novel application of Large Language Models to code optimization. We
present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly …

Enregistrer Citer Cité 43 fois Autres articles Les 3 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Tpugraphs: A performance prediction dataset on large tensor computational graphs

M Phothilimthana, S Abu-El-Haija… - Advances in …, 2023 - proceedings.neurips.cc

Precise hardware performance models play a crucial role in code optimizations. They can
assist compilers in making heuristic decisions or aid autotuners in identifying the optimal …

Enregistrer Citer Cité 14 fois Autres articles Les 7 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Meta large language model compiler: Foundation models of compiler optimization

C Cummins, V Seeker, D Grubisic, B Roziere… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable capabilities across a
variety of software engineering and coding tasks. However, their application in the domain of …

Enregistrer Citer Cité 16 fois Autres articles Les 4 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Telamalloc: Efficient on-chip memory allocation for production machine learning accelerators

M Maas, U Beaugnon, A Chauhan, B Ilbeyi - Proceedings of the 28th …, 2022 - dl.acm.org

Memory buffer allocation for on-chip memories is a major challenge in modern machine
learning systems that target ML accelerators. In interactive systems such as mobile phones …

Enregistrer Citer Cité 22 fois Autres articles Les 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Compiler generated feedback for Large Language Models

D Grubisic, C Cummins, V Seeker… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce a novel paradigm in compiler optimization powered by Large Language
Models with compiler feedback to optimize the code size of LLVM assembly. The model …

Enregistrer Citer Cité 7 fois Autres articles Les 2 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] github.io

ALT: Breaking the wall between data layout and loop optimizations for deep learning compilation

Z Xu, J Xu, H Peng, W Wang, X Wang, H Wan… - Proceedings of the …, 2023 - dl.acm.org

Deep learning models rely on highly optimized tensor libraries for efficient inference on
heterogeneous hardware. Current deep compilers typically predetermine layouts of tensors …

Enregistrer Citer Cité 4 fois Autres articles Les 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Understanding LLM Embeddings for Regression

E Tang, B Yang, X Song - arxiv preprint arxiv:2411.14708, 2024 - arxiv.org

With the rise of large language models (LLMs) for flexibly processing information as strings,
a natural application is regression, specifically by preprocessing string representations into …

Enregistrer Citer Cité 2 fois Autres articles Les 2 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mlgoperf: An ml guided inliner to optimize performance

AH Ashouri, M Elhoushi, Y Hua, X Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

For the past 25 years, we have witnessed an extensive application of Machine Learning to
the Compiler space; the selection and the phase-ordering problem. However, limited works …

Enregistrer Citer Cité 10 fois Autres articles Les 3 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Neural architecture search using property guided synthesis

C **, PM Phothilimthana, S Roy - Proceedings of the ACM on …, 2022 - dl.acm.org

Neural architecture search (NAS) has become an increasingly important tool within the deep
learning community in recent years, yielding many practical advancements in the design of …

Enregistrer Citer Cité 12 fois Autres articles Les 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Saturn: An Optimized Data System for Large Model Deep Learning Workloads

K Nagrecha, A Kumar - arxiv preprint arxiv:2309.01226, 2023 - arxiv.org

Large language models such as GPT-3 & ChatGPT have transformed deep learning (DL),
powering applications that have captured the public's imagination. These models are rapidly …

Enregistrer Citer Cité 1 fois Autres articles Les 2 versions Free GPT-4 DeepSeek Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

A flexible approach to autotuning multi-pass machine learning compilers

Large language models for compiler optimization

Tpugraphs: A performance prediction dataset on large tensor computational graphs

Meta large language model compiler: Foundation models of compiler optimization

Telamalloc: Efficient on-chip memory allocation for production machine learning accelerators

Compiler generated feedback for Large Language Models

ALT: Breaking the wall between data layout and loop optimizations for deep learning compilation

Understanding LLM Embeddings for Regression

Mlgoperf: An ml guided inliner to optimize performance

Neural architecture search using property guided synthesis

Saturn: An Optimized Data System for Large Model Deep Learning Workloads