Compiler transformations for high-performance computing

DF Bacon, SL Graham, OJ Sharp - ACM Computing Surveys (CSUR), 1994 - dl.acm.org
In the last three decades a large number of compiler transformations for optimizing programs
have been implemented. Most optimizations for uniprocessors reduce the number of …

[PDF][PDF] An Overview of the SUIF Compiler for Scalable Parallel Machines.

SP Amarasinghe, JAM Anderson, MS Lam, CW Tseng - PPSC, 1995 - Citeseer
We are building a compiler that automatically translates sequential scienti c programs into
parallel code for scalable parallel machines. Many of the compiler techniques needed to …

[書籍][B] Parallel computer architecture: a hardware/software approach

D Culler, JP Singh, A Gupta - 1999 - books.google.com
The most exciting development in parallel computer architecture is the convergence of
traditionally disparate approaches on a common machine structure. This book explains the …

SUIF: An infrastructure for research on parallelizing and optimizing compilers

RP Wilson, RS French, CS Wilson… - ACM Sigplan …, 1994 - dl.acm.org
Compiler infrastructures that support experimental research are crucial to the advancement
of high-performance computing. New compiler technology must be implemented and …

Maximizing multiprocessor performance with the SUIF compiler

MW Hall, JM Anderson, SP Amarasinghe… - Computer, 1996 - ieeexplore.ieee.org
This article describes automatic parallelization techniques in the SUIF (Stanford University
Intermediate Format) compiler that result in good multiprocessor performance for array …

An approach for quantitative analysis of application-specific dataflow architectures

B Kienhuis, E Deprettere, K Vissers… - Proceedings IEEE …, 1997 - ieeexplore.ieee.org
In this paper we present an approach for quantitative analysis of application-specific
dataflow architectures. The approach allows the designer to rate design alternatives in a …

Map** irregular applications to DIVA, a PIM-based data-intensive architecture

M Hall, P Kogge, J Koller, P Diniz, J Chame… - Proceedings of the …, 1999 - dl.acm.org
Abstract Processing-in-memory (PIM) chips that integrate processor logic into memory
devices offer a new opportunity for bridging the growing gap between processor and …

Communication optimization and code generation for distributed memory machines

SP Amarasinghe, MS Lam - Proceedings of the ACM SIGPLAN 1993 …, 1993 - dl.acm.org
This paper presents several algorithms to solve code generation and optimization problems
specific to machines with distributed address spaces. Given a description of how the …

Dimmwitted: A study of main-memory statistical analytics

C Zhang, C Ré - arxiv preprint arxiv:1403.7550, 2014 - arxiv.org
We perform the first study of the tradeoff space of access methods and replication to support
statistical analytics using first-order methods executed in the main memory of a Non-Uniform …

[PDF][PDF] An affine partitioning algorithm to maximize parallelism and minimize communication

AW Lim, GI Cheong, MS Lam - … of the 13th international conference on …, 1999 - dl.acm.org
An affine partitioning Framework unifies many useful program transforms such as
unimodular transformations(interchange, reversal, skewing), loop fusion, fission, scaling …