Programl: A graph-based program representation for data flow analysis and compiler optimizations

C Cummins, ZV Fisches, T Ben-Nun… - International …, 2021 - proceedings.mlr.press
Abstract Machine learning (ML) is increasingly seen as a viable approach for building
compiler optimization heuristics, but many ML methods cannot replicate even the simplest of …

A survey of techniques for dynamic branch prediction

S Mittal - Concurrency and Computation: Practice and …, 2019 - Wiley Online Library
Branch predictor (BP) is an essential component in modern processors since high BP
accuracy can improve performance and reduce energy by decreasing the number of …

Qiskit pulse: programming quantum computers through the cloud with pulses

T Alexander, N Kanazawa, DJ Egger… - Quantum Science …, 2020 - iopscience.iop.org
The quantum circuit model is an abstraction that hides the underlying physical
implementation of gates and measurements on a quantum computer. For precise control of …

Deepbindiff: Learning program-wide code representations for binary diffing

Y Duan, X Li, J Wang, H Yin - 2020 - ink.library.smu.edu.sg
Binary diffing analysis quantitatively measures the differences between two given binaries
and produces fine-grained basic block matching. It has been widely used to enable different …

AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs

Q Wang, X Zhang, Y Zhang, Q Yi - Proceedings of the international …, 2013 - dl.acm.org
Basic Liner algebra subprograms (BLAS) is a fundamental library in scientific computing. In
this paper, we present a template-based optimization framework, AUGEM, which can …

TRIMMER: application specialization for code debloating

H Sharif, M Abubakar, A Gehani, F Zaffar - Proceedings of the 33rd ACM …, 2018 - dl.acm.org
With the proliferation of new hardware architectures and ever-evolving user requirements,
the software stack is becoming increasingly bloated. In practice, only a limited subset of the …

[BOK][B] Heterogeneous computing with OpenCL 2.0

DR Kaeli, P Mistry, D Schaa, DP Zhang - 2015 - books.google.com
Heterogeneous Computing with OpenCL 2.0 teaches OpenCL and parallel programming for
complex systems that may include a variety of device architectures: multi-core CPUs, GPUs …

QED at large: A survey of engineering of formally verified software

T Ringer, K Palmskog, I Sergey… - … and Trends® in …, 2019 - nowpublishers.com
Abstract Development of formal proofs of correctness of programs can increase actual and
perceived reliability and facilitate better understanding of program specifications and their …

Finding effective compilation sequences

L Almagor, KD Cooper, A Grosul, TJ Harvey… - ACM SIGPLAN …, 2004 - dl.acm.org
Most modern compilers operate by applying a fixed, program-independent sequence of
optimizations to all programs. Compiler writers choose a single" compilation sequence", or …

Unleashing SmartNIC packet processing performance in P4

J **ng, Y Qiu, KF Hsu, S Sui, K Manaa… - Proceedings of the …, 2023 - dl.acm.org
SmartNICs are on the rise as a packet processing platform, with the trend towards a uniform
P4 programming model. However, unleashing SmartNIC packet processing performance in …