Polyhedral parallelization of binary code

B Pradelle, A Ketterlin, P Clauss - ACM Transactions on Architecture and …, 2012 - dl.acm.org
Many automatic software parallelization systems have been proposed in the past decades,
but most of them are dedicated to source-to-source transformations. This paper shows that …

ExanaDBT: A dynamic compilation system for transparent polyhedral optimizations at runtime

Y Sato, T Yuki, T Endo - Proceedings of the Computing Frontiers …, 2017 - dl.acm.org
In this paper, we present a dynamic compilation system called ExanaDBT for transparently
optimizing and parallelizing binaries at runtime based on the polyhedral model. Starting …

Speculative Code Compaction: Eliminating Dead Code via Speculative Microcode Transformations

L Moody, W Qi, A Sharifi, L Berry… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
The computing landscape has been increasingly characterized by processor architectures
with increasing core counts, while a majority of the software applications remain inherently …

Asc: Automatically scalable computation

A Waterland, E Angelino, RP Adams… - ACM SIGARCH …, 2014 - dl.acm.org
We present an architecture designed to transparently and automatically scale the
performance of sequential programs as a function of the hardware resources available. The …

Limits of region-based dynamic binary parallelization

TJK Edler von Koch, B Franke - ACM SIGPLAN Notices, 2013 - dl.acm.org
Efficiently executing sequential legacy binaries on chip multi-processors (CMPs) composed
of many, small cores is one of today's most pressing problems. Single-threaded execution is …

Computational caches

A Waterland, E Angelino, ED Cubuk, E Kaxiras… - Proceedings of the 6th …, 2013 - dl.acm.org
Caching is a well-known technique for speeding up computation. We cache data from file
systems and databases; we cache dynamically generated code blocks; we cache page …

Automatic parallelization of sequential programs

P Kraft, A Waterland, DY Fu, A Gollamudi… - arxiv preprint arxiv …, 2018 - arxiv.org
Prior work on Automatically Scalable Computation (ASC) suggests that it is possible to
parallelize sequential computation by building a model of whole-program execution, using …

Affine parallelization using dependence and cache analysis in a binary rewriter

A Kotha, K Anand, T Creech… - … on Parallel and …, 2014 - ieeexplore.ieee.org
Today, nearly all general-purpose computers are parallel, but nearly all software running on
them is serial. Bridging this disconnect by manually rewriting source code in parallel is …

Runtime on-stack parallelization of dependence-free for-loops in binary programs

M Yusuf, A El-Mahdy, E Rohou - IEEE Letters of the Computer …, 2019 - ieeexplore.ieee.org
With the multicore trend, the need for automatic parallelization is more pronounced,
especially for legacy and proprietary code where no source code is available and/or the …

Robust Low-Overhead Binary Rewriting: Design, Extensibility, and Customizability

AM Kupaei - 2021 - search.proquest.com
Binary rewriting is the foundation of a wide range of binary analysis tools and techniques,
including securing untrusted code, enforcing control-flow integrity, dynamic optimization …