A practical automatic polyhedral parallelizer and locality optimizer

U Bondhugula, A Hartono, J Ramanujam… - Proceedings of the 29th …, 2008 - dl.acm.org
We present the design and implementation of an automatic polyhedral source-to-source
transformation framework that can optimize regular programs (sequences of possibly …

Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model

U Bondhugula, M Baskaran, S Krishnamoorthy… - … CC 2008, Held as Part of …, 2008 - Springer
The polyhedral model provides powerful abstractions to optimize loop nests with regular
accesses. Affine transformations in this model capture a complex sequence of execution …

[PDF][PDF] Pluto: A practical and fully automatic polyhedral program optimization system

U Bondhugula, A Hartono, J Ramanujam… - Proceedings of the …, 2008 - researchgate.net
We present the design and implementation of a fully automatic polyhedral source-to-source
transformation framework that can optimize regular programs (sequences of possibly …

[PDF][PDF] An affine partitioning algorithm to maximize parallelism and minimize communication

AW Lim, GI Cheong, MS Lam - … of the 13th international conference on …, 1999 - dl.acm.org
An affine partitioning Framework unifies many useful program transforms such as
unimodular transformations(interchange, reversal, skewing), loop fusion, fission, scaling …

Loop transformations: convexity, pruning and optimization

LN Pouchet, U Bondhugula, C Bastoul, A Cohen… - ACM SIGPLAN …, 2011 - dl.acm.org
High-level loop transformations are a key instrument in map** computational kernels to
effectively exploit the resources in modern processor architectures. Nevertheless, selecting …

[KIRJA][B] Scheduling and automatic parallelization

A Darte, Y Robert, F Vivien - 2012 - books.google.com
Page 1 Alain Darte Yves Robert Frédéric Vivien Scheduling and Automatic Parallelization pringer
Science Business Media, LLC Page 2 Scheduling and Automatic Parallelization Page 3 Alain …

[HTML][HTML] Maximizing parallelism and minimizing synchronization with affine partitions

AW Lim, MS Lam - Parallel computing, 1998 - Elsevier
This paper presents an algorithm to find the optimal affine partitions that maximize the
degree of parallelism and minimize the degree of synchronization in programs with arbitrary …

A framework for enhancing data reuse via associative reordering

K Stock, M Kong, T Grosser, LN Pouchet… - Proceedings of the 35th …, 2014 - dl.acm.org
The freedom to reorder computations involving associative operators has been widely
recognized and exploited in designing parallel algorithms and to a more limited extent in …

Loop parallelization algorithms: From parallelism extraction to code generation

P Boulet, A Darte, GA Silber, F Vivien - Parallel Computing, 1998 - Elsevier
In this paper, we survey loop parallelization algorithms, analyzing the dependence
representations they use, the loop transformations they generate, the code generation …

Combined iterative and model-driven optimization in an automatic parallelization framework

LN Pouchet, U Bondhugula, C Bastoul… - SC'10: Proceedings …, 2010 - ieeexplore.ieee.org
Today's multi-core era places significant demands on an optimizing compiler, which must
parallelize programs, exploit memory hierarchy, and leverage the ever-increasing SIMD …