A survey on compiler autotuning using machine learning
Since the mid-1990s, researchers have been trying to use machine-learning-based
approaches to solve a number of different compiler optimization problems. These …
approaches to solve a number of different compiler optimization problems. These …
Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology
Modern microprocessors can achieve high performance on linear algebra kernels but this
currently requires extensive machinoapeci6c hand tuning. We have developed a …
currently requires extensive machinoapeci6c hand tuning. We have developed a …
Petabricks: A language and compiler for algorithmic choice
It is often impossible to obtain a one-size-fits-all solution for high performance algorithms
when considering different choices for data distributions, parallelism, transformations, and …
when considering different choices for data distributions, parallelism, transformations, and …
Rigorous floating-point mixed-precision tuning
Virtually all real-valued computations are carried out using floating-point data types and
operations. The precision of these data types must be set with the goals of reducing the …
operations. The precision of these data types must be set with the goals of reducing the …
[BOOK][B] Automatic performance tuning of sparse matrix kernels
RW Vuduc - 2003 - search.proquest.com
This dissertation presents an automated system to generate highly efficient, platform-
adapted implementations of sparse matrix kernels. We show that conventional …
adapted implementations of sparse matrix kernels. We show that conventional …
Rapidly selecting good compiler optimizations using performance counters
Applying the right compiler optimizations to a particular program can have a significant
impact on program performance. Due to the non-linear interaction of compiler optimizations …
impact on program performance. Due to the non-linear interaction of compiler optimizations …
Predictive modeling in a polyhedral optimization space
High-level program optimizations, such as loop transformations, are critical for high
performance on multi-core targets. However, complex sequences of loop transformations …
performance on multi-core targets. However, complex sequences of loop transformations …
Fast multi-parameter performance modeling
Tuning large applications requires a clever exploration of the design and configuration
space. Especially on supercomputers, this space is so large that its exhaustive traversal via …
space. Especially on supercomputers, this space is so large that its exhaustive traversal via …
Sound mixed-precision optimization with rewriting
Finite-precision arithmetic, widely used in embedded systems for numerical calculations,
faces an inherent tradeoff between accuracy and efficiency. The points in this tradeoff space …
faces an inherent tradeoff between accuracy and efficiency. The points in this tradeoff space …
POET: Parameterized optimizations for empirical tuning
The excessive complexity of both machine architectures and applications have made it
difficult for compilers to statically model and predict application behavior. This observation …
difficult for compilers to statically model and predict application behavior. This observation …