Compiler transformations for high-performance computing

DF Bacon, SL Graham, OJ Sharp - ACM Computing Surveys (CSUR), 1994 - dl.acm.org
In the last three decades a large number of compiler transformations for optimizing programs
have been implemented. Most optimizations for uniprocessors reduce the number of …

A survey of adaptive optimization in virtual machines

M Arnold, SJ Fink, D Grove, M Hind… - Proceedings of the …, 2005 - ieeexplore.ieee.org
Virtual machines face significant performance challenges beyond those confronted by
traditional static optimizers. First, portable program representations and dynamic language …

[KSIĄŻKA][B] Computer architecture: a quantitative approach

JL Hennessy, DA Patterson - 2011 - books.google.com
Computer Architecture: A Quantitative Approach, Fifth Edition, explores the ways that
software and technology in the cloud are accessed by digital media, such as cell phones …

Mediabench: A tool for evaluating and synthesizing multimedia and communications systems

C Lee, M Potkonjak… - Proceedings of 30th …, 1997 - ieeexplore.ieee.org
Significant advances have been made in compilation technology for capitalizing on
instruction-level parallelism (ILP). The vast majority of ILP compilation research has been …

[KSIĄŻKA][B] Engineering a compiler

KD Cooper, L Torczon - 2022 - books.google.com
Engineering a Compiler, Third Edition covers the latest developments in compiler
technology, with new chapters focusing on semantic elaboration (the problems that arise in …

Dynamo: A transparent dynamic optimization system

V Bala, E Duesterwald, S Banerjia - Proceedings of the ACM SIGPLAN …, 2000 - dl.acm.org
We describe the design and implementation of Dynamo, a software dynamic optimization
system that is capable of transparently improving the performance of a native instruction …

Iterative modulo scheduling: An algorithm for software pipelining loops

BR Rau - Proceedings of the 27th annual international …, 1994 - dl.acm.org
Modulo scheduling is a framework within which a wide variety of algorithms and heuristics
may be defined for software pipelining innermost loops. This paper presents a practical …

[PDF][PDF] Effective compiler support for predicated execution using the hyperblock

SA Mahlke, DC Lin, WY Chen, RE Hank… - ACM SIGMICRO …, 1992 - dl.acm.org
Predicated execution is an effective technique for dealing with conditional branches in
application programs. However, there are several problems associated with conventional …

Instruction-level parallel processing: History, overview, and perspective

BR Rau, JA Fisher - The journal of Supercomputing, 1993 - Springer
Instruction-level parallelism (ILP) is a family of processor and compiler design techniques
that speed up execution by causing individual machine operations to execute in parallel …

Space-time scheduling of instruction-level parallelism on a raw machine

W Lee, R Barua, M Frank, D Srikrishna, J Babb… - ACM SIGOPS …, 1998 - dl.acm.org
Increasing demand for both greater parallelism and faster clocks dictate that future
generation architectures will need to decentralize their resources and eliminate primitives …