A deep learning based cost model for automatic code optimization
R Baghdadi, M Merouani… - Proceedings of …, 2021 - proceedings.mlsys.org
Enabling compilers to automatically optimize code has been a longstanding goal for the
compiler community. Efficiently solving this problem requires using precise cost models …
compiler community. Efficiently solving this problem requires using precise cost models …
Pencil: A platform-neutral compute intermediate language for accelerator programming
Programming accelerators such as GPUs with low-level APIs and languages such as
OpenCL and CUDA is difficult, error-prone, and not performance-portable. Automatic …
OpenCL and CUDA is difficult, error-prone, and not performance-portable. Automatic …
A compiler for throughput optimization of graph algorithms on GPUs
Writing high-performance GPU implementations of graph algorithms can be challenging. In
this paper, we argue that three optimizations called throughput optimizations are key to high …
this paper, we argue that three optimizations called throughput optimizations are key to high …
LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
M Merouani, KA Boudaoud, IN Aouadj… - arxiv preprint arxiv …, 2024 - arxiv.org
While polyhedral compilers have shown success in implementing advanced code
transformations, they still have challenges in selecting the most profitable transformations …
transformations, they still have challenges in selecting the most profitable transformations …
Polyhedral optimizations of explicitly parallel programs
The polyhedral model is a powerful algebraic framework that has enabled significant
advances to analysis and transformation of sequential affine (sub) programs, relative to …
advances to analysis and transformation of sequential affine (sub) programs, relative to …
Verification of loop parallelisations
Writing correct parallel programs becomes more and more difficult as the complexity and
heterogeneity of processors increase. This issue is addressed by parallelising compilers …
heterogeneity of processors increase. This issue is addressed by parallelising compilers …
VOBLA: A vehicle for optimized basic linear algebra
U Beaugnon, A Kravets, S Van Haastregt… - Proceedings of the …, 2014 - dl.acm.org
We present VOBLA, a domain-specific language designed for programming linear algebra
libraries. VOBLA is compiled to PENCIL, a domain independent intermediate language …
libraries. VOBLA is compiled to PENCIL, a domain independent intermediate language …
Algorithmic species revisited: A program code classification based on array references
C Nugteren, R Corvino… - 2013 IEEE 6th …, 2013 - ieeexplore.ieee.org
The shift towards parallel processor architectures has made programming, performance
prediction and code generation increasingly challenging. Abstract representations of …
prediction and code generation increasingly challenging. Abstract representations of …
Program correctness by transformation
Deductive program verification can be used effectively to verify high-level programs, but can
be challenging for low-level, high-performance code. In this paper, we argue that …
be challenging for low-level, high-performance code. In this paper, we argue that …
[PDF][PDF] Polyhedral transformations of explicitly parallel programs
Polyhedral Transformations of Explicitly Parallel Programs Page 1 1/42 Polyhedral
Transformations of Explicitly Parallel Programs Prasanth Chatarasi, Jun Shirako, Vivek Sarkar …
Transformations of Explicitly Parallel Programs Prasanth Chatarasi, Jun Shirako, Vivek Sarkar …