Compiler transformations for high-performance computing
DF Bacon, SL Graham, OJ Sharp - ACM Computing Surveys (CSUR), 1994 - dl.acm.org
In the last three decades a large number of compiler transformations for optimizing programs
have been implemented. Most optimizations for uniprocessors reduce the number of …
have been implemented. Most optimizations for uniprocessors reduce the number of …
[PDF][PDF] An Overview of the SUIF Compiler for Scalable Parallel Machines.
We are building a compiler that automatically translates sequential scienti c programs into
parallel code for scalable parallel machines. Many of the compiler techniques needed to …
parallel code for scalable parallel machines. Many of the compiler techniques needed to …
[書籍][B] Parallel computer architecture: a hardware/software approach
The most exciting development in parallel computer architecture is the convergence of
traditionally disparate approaches on a common machine structure. This book explains the …
traditionally disparate approaches on a common machine structure. This book explains the …
SUIF: An infrastructure for research on parallelizing and optimizing compilers
RP Wilson, RS French, CS Wilson… - ACM Sigplan …, 1994 - dl.acm.org
Compiler infrastructures that support experimental research are crucial to the advancement
of high-performance computing. New compiler technology must be implemented and …
of high-performance computing. New compiler technology must be implemented and …
Maximizing multiprocessor performance with the SUIF compiler
This article describes automatic parallelization techniques in the SUIF (Stanford University
Intermediate Format) compiler that result in good multiprocessor performance for array …
Intermediate Format) compiler that result in good multiprocessor performance for array …
An approach for quantitative analysis of application-specific dataflow architectures
B Kienhuis, E Deprettere, K Vissers… - Proceedings IEEE …, 1997 - ieeexplore.ieee.org
In this paper we present an approach for quantitative analysis of application-specific
dataflow architectures. The approach allows the designer to rate design alternatives in a …
dataflow architectures. The approach allows the designer to rate design alternatives in a …
Map** irregular applications to DIVA, a PIM-based data-intensive architecture
Abstract Processing-in-memory (PIM) chips that integrate processor logic into memory
devices offer a new opportunity for bridging the growing gap between processor and …
devices offer a new opportunity for bridging the growing gap between processor and …
Communication optimization and code generation for distributed memory machines
This paper presents several algorithms to solve code generation and optimization problems
specific to machines with distributed address spaces. Given a description of how the …
specific to machines with distributed address spaces. Given a description of how the …
Dimmwitted: A study of main-memory statistical analytics
We perform the first study of the tradeoff space of access methods and replication to support
statistical analytics using first-order methods executed in the main memory of a Non-Uniform …
statistical analytics using first-order methods executed in the main memory of a Non-Uniform …
[PDF][PDF] An affine partitioning algorithm to maximize parallelism and minimize communication
AW Lim, GI Cheong, MS Lam - … of the 13th international conference on …, 1999 - dl.acm.org
An affine partitioning Framework unifies many useful program transforms such as
unimodular transformations(interchange, reversal, skewing), loop fusion, fission, scaling …
unimodular transformations(interchange, reversal, skewing), loop fusion, fission, scaling …