Parallel programming with Polaris
Parallel programming tools are limited, making effective parallel programming difficult and
cumbersome. Compilers that translate conventional sequential programs into parallel form …
cumbersome. Compilers that translate conventional sequential programs into parallel form …
The paralax infrastructure: Automatic parallelization with a hel** hand
Speeding up sequential programs on multicores is a challenging problem that is in urgent
need of a solution. Automatic parallelization of irregular pointer-intensive codes, exemplified …
need of a solution. Automatic parallelization of irregular pointer-intensive codes, exemplified …
Hybrid analysis: static & dynamic memory reference analysis
We present a novel Hybrid Analysis technology which can efficiently and seamlessly
integrate all static and run-time analysis of memory references into a single framework that is …
integrate all static and run-time analysis of memory references into a single framework that is …
Compile-time composition of run-time data and iteration reorderings
MM Strout, L Carter, J Ferrante - Proceedings of the ACM SIGPLAN 2003 …, 2003 - dl.acm.org
Many important applications, such as those using sparse data structures, have memory
reference patterns that are unknown at compile-time. Prior work has developed run-time …
reference patterns that are unknown at compile-time. Prior work has developed run-time …
Integrating profile-driven parallelism detection and machine-learning-based map**
Compiler-based auto-parallelization is a much-studied area but has yet to find widespread
application. This is largely due to the poor identification and exploitation of application …
application. This is largely due to the poor identification and exploitation of application …
On the automatic parallelization of the Perfect Benchmarks (R)
R Eigenmann, J Hoeflinger… - IEEE Transactions on …, 1998 - ieeexplore.ieee.org
This paper presents the results of the Cedar Hand-Parallelization Experiment conducted
from 1989 through 1992, within the Center for Supercomputing Research and Development …
from 1989 through 1992, within the Center for Supercomputing Research and Development …
Parallelizing sequential programs with statistical accuracy tests
We present QuickStep, a novel system for parallelizing sequential programs. Unlike
standard parallelizing compilers (which are designed to preserve the semantics of the …
standard parallelizing compilers (which are designed to preserve the semantics of the …
Automating wavefront parallelization for sparse matrix computations
This paper presents a compiler and runtime framework for parallelizing sparse matrix
computations that have loop-carried dependences. Our approach automatically generates a …
computations that have loop-carried dependences. Our approach automatically generates a …
Exposing speculative thread parallelism in SPEC2000
MK Prabhu, K Olukotun - Proceedings of the tenth ACM SIGPLAN …, 2005 - dl.acm.org
As increasing the performance of single-threaded processors becomes increasingly difficult,
consumer desktop processors are moving toward multi-core designs. One way to enhance …
consumer desktop processors are moving toward multi-core designs. One way to enhance …
Runtime composition of iterations for fusing loop-carried sparse dependence
Dependence between iterations in sparse computations causes inefficient use of memory
and computation resources. This paper proposes sparse fusion, a technique that generates …
and computation resources. This paper proposes sparse fusion, a technique that generates …