GRANITE: A graph neural network model for basic block throughput estimation

O Sýkora, PM Phothilimthana, C Mendis… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Analytical hardware performance models yield swift estimation of desired hardware
performance metrics. However, develo** these analytical models for modern processors …

uiCA: Accurate throughput prediction of basic blocks on recent Intel microarchitectures

A Abel, J Reineke - Proceedings of the 36th ACM International …, 2022 - dl.acm.org
Performance models that statically predict the steady-state throughput of basic blocks on
particular microarchitectures, such as IACA, Ithemal, llvm-mca, OSACA, or CQA, can guide …

Survey on combinatorial register allocation and instruction scheduling

RC Lozano, C Schulte - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Register allocation (map** variables to processor registers or memory) and instruction
scheduling (reordering instructions to increase instruction-level parallelism) are essential …

Combinatorial register allocation and instruction scheduling

RC Lozano, M Carlsson, GH Blindell… - ACM Transactions on …, 2019 - dl.acm.org
This article introduces a combinatorial optimization approach to register allocation and
instruction scheduling, two central compiler problems. Combinatorial optimization has the …

Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920

S Tan, Q Jiang, Z Cao, X Hao, J Chen, H An - CCF Transactions on High …, 2024 - Springer
The performance of high-performance computing (HPC) and other real-world applications is
becoming unpredictable as the micro-architecture of the modern central processing unit …

Rl4real: Reinforcement learning for register allocation

S VenkataKeerthy, S Jain, A Kundu… - Proceedings of the …, 2023 - dl.acm.org
We aim to automate decades of research and experience in register allocation, leveraging
machine learning. We tackle this problem by embedding a multi-agent reinforcement …

Instruction selection

GH Blindell - Principles, Methods, and Applications, 2016 - Springer
Like most doctoral students, I started my studies by reviewing the existing, most prominent
approaches in the field. A couple of months later I thought I had acquired a sufficient …

Constraint-based diversification of jop gadgets

RM Tsoupidi, RC Lozano, B Baudry - Journal of Artificial Intelligence …, 2021 - jair.org
Modern software deployment process produces software that is uniform, and hence
vulnerable to large-scale code-reuse attacks, such as Jump-Oriented Programming (JOP) …

Trace-based register allocation in a jit compiler

J Eisl, M Grimmer, D Simon, T Würthinger… - Proceedings of the 13th …, 2016 - dl.acm.org
State-of-the-art dynamic compilers often use global approaches, like Linear Scan or Graph
Coloring, for register allocation. These algorithms consider the complete compilation unit for …

Evolutionary algorithms for instruction scheduling, operation merging, and register allocation in VLIW compilers

F Giesemann, L Gerlach, G Paya-Vaya - Journal of Signal Processing …, 2020 - Springer
Code generation for VLIW processors includes several optimization problems like code
optimization, instruction scheduling, and register allocation. The high complexity of these …