A survey on compiler autotuning using machine learning
Since the mid-1990s, researchers have been trying to use machine-learning-based
approaches to solve a number of different compiler optimization problems. These …
approaches to solve a number of different compiler optimization problems. These …
Dnnfusion: accelerating deep neural networks execution with advanced operator fusion
Deep Neural Networks (DNNs) have emerged as the core enabler of many major
applications on mobile devices. To achieve high accuracy, DNN models have become …
applications on mobile devices. To achieve high accuracy, DNN models have become …
Machine learning in compiler optimization
In the last decade, machine-learning-based compilation has moved from an obscure
research niche to a mainstream activity. In this paper, we describe the relationship between …
research niche to a mainstream activity. In this paper, we describe the relationship between …
A practical automatic polyhedral parallelizer and locality optimizer
We present the design and implementation of an automatic polyhedral source-to-source
transformation framework that can optimize regular programs (sequences of possibly …
transformation framework that can optimize regular programs (sequences of possibly …
Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive map**
Heterogeneous multiprocessors are increasingly important in the multi-core era due to their
potential for high performance and energy efficiency. In order for software to fully realize this …
potential for high performance and energy efficiency. In order for software to fully realize this …
The polyhedral model is more widely applicable than you think
The polyhedral model is a powerful framework for automatic optimization and parallelization.
It is based on an algebraic representation of programs, allowing to construct and search for …
It is based on an algebraic representation of programs, allowing to construct and search for …
PolySA: Polyhedral-based systolic array auto-compilation
J Cong, J Wang - 2018 IEEE/ACM International Conference on …, 2018 - ieeexplore.ieee.org
Automatic systolic array generation has long been an interesting topic due to the need to
reduce the lengthy development cycles of manual designs. Existing automatic systolic array …
reduce the lengthy development cycles of manual designs. Existing automatic systolic array …
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model
The polyhedral model provides powerful abstractions to optimize loop nests with regular
accesses. Affine transformations in this model capture a complex sequence of execution …
accesses. Affine transformations in this model capture a complex sequence of execution …
[PDF][PDF] Pluto: A practical and fully automatic polyhedral program optimization system
We present the design and implementation of a fully automatic polyhedral source-to-source
transformation framework that can optimize regular programs (sequences of possibly …
transformation framework that can optimize regular programs (sequences of possibly …
Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based map**
Compiler-based auto-parallelization is a much studied area, yet has still not found wide-
spread application. This is largely due to the poor exploitation of application parallelism …
spread application. This is largely due to the poor exploitation of application parallelism …