Divide and conquer frontend bottleneck

A Ansari, P Lotfi-Kamran… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
The frontend stalls caused by instruction and BTB misses are a significant source of
performance degradation in server processors. Prefetchers are commonly employed to …

Q-zilla: A scheduling framework and core microarchitecture for tail-tolerant microservices

A Mirhosseini, BL West, GW Blake… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Managing tail latency is a primary challenge in designing large-scale Internet services.
Queuing is a major contributor to end-to-end tail latency, wherein nominal tasks are …

Resemble: reinforced ensemble framework for data prefetching

P Zhang, R Kannan, A Srivastava… - … Conference for High …, 2022 - ieeexplore.ieee.org
Data prefetching hides memory latency by predicting and loading necessary data into cache
beforehand. Most prefetchers in the literature are efficient for specific memory address …

[PDF][PDF] Multi-lookahead offset prefetching

M Shakerinava… - The Third Data …, 2019 - dpc3.compas.cs.stonybrook.edu
Offset prefetching has been recently proposed as a lowoverhead yet high-performance
approach to eliminate data cache misses or reduce their negative effect. In offset …

Bow: Breathing operand windows to exploit bypassing in gpus

HA Esfeden, A Abdolrashidi, S Rahman… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
The Register File (RF) is a critical structure in Graphics Processing Units (GPUs) responsible
for a large portion of the area and power. To simplify the architecture of the RF, it is …

[HTML][HTML] LSTM-CRP: Algorithm-Hardware Co-Design and Implementation of Cache Replacement Policy Using Long Short-Term Memory

Y Wang, Y Meng, J Wang, C Yang - Big Data and Cognitive Computing, 2024 - mdpi.com
As deep learning has produced dramatic breakthroughs in many areas, it has motivated
emerging studies on the combination between neural networks and cache replacement …

A survey on recent hardware data prefetching approaches with an emphasis on servers

M Bakhshalipour, M Shakerinava, F Golshan… - arxiv preprint arxiv …, 2020 - arxiv.org
Data prefetching, ie, the act of predicting application's future memory accesses and fetching
those that are not in the on-chip caches, is a well-known and widely-used approach to hide …