GRANITE: A graph neural network model for basic block throughput estimation
Analytical hardware performance models yield swift estimation of desired hardware
performance metrics. However, develo** these analytical models for modern processors …
performance metrics. However, develo** these analytical models for modern processors …
uiCA: Accurate throughput prediction of basic blocks on recent Intel microarchitectures
Performance models that statically predict the steady-state throughput of basic blocks on
particular microarchitectures, such as IACA, Ithemal, llvm-mca, OSACA, or CQA, can guide …
particular microarchitectures, such as IACA, Ithemal, llvm-mca, OSACA, or CQA, can guide …
Survey on combinatorial register allocation and instruction scheduling
Register allocation (map** variables to processor registers or memory) and instruction
scheduling (reordering instructions to increase instruction-level parallelism) are essential …
scheduling (reordering instructions to increase instruction-level parallelism) are essential …
Combinatorial register allocation and instruction scheduling
This article introduces a combinatorial optimization approach to register allocation and
instruction scheduling, two central compiler problems. Combinatorial optimization has the …
instruction scheduling, two central compiler problems. Combinatorial optimization has the …
Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920
The performance of high-performance computing (HPC) and other real-world applications is
becoming unpredictable as the micro-architecture of the modern central processing unit …
becoming unpredictable as the micro-architecture of the modern central processing unit …
Rl4real: Reinforcement learning for register allocation
We aim to automate decades of research and experience in register allocation, leveraging
machine learning. We tackle this problem by embedding a multi-agent reinforcement …
machine learning. We tackle this problem by embedding a multi-agent reinforcement …
Instruction selection
GH Blindell - Principles, Methods, and Applications, 2016 - Springer
Like most doctoral students, I started my studies by reviewing the existing, most prominent
approaches in the field. A couple of months later I thought I had acquired a sufficient …
approaches in the field. A couple of months later I thought I had acquired a sufficient …
Constraint-based diversification of jop gadgets
Modern software deployment process produces software that is uniform, and hence
vulnerable to large-scale code-reuse attacks, such as Jump-Oriented Programming (JOP) …
vulnerable to large-scale code-reuse attacks, such as Jump-Oriented Programming (JOP) …
Trace-based register allocation in a jit compiler
State-of-the-art dynamic compilers often use global approaches, like Linear Scan or Graph
Coloring, for register allocation. These algorithms consider the complete compilation unit for …
Coloring, for register allocation. These algorithms consider the complete compilation unit for …
Evolutionary algorithms for instruction scheduling, operation merging, and register allocation in VLIW compilers
Code generation for VLIW processors includes several optimization problems like code
optimization, instruction scheduling, and register allocation. The high complexity of these …
optimization, instruction scheduling, and register allocation. The high complexity of these …