Академия Google

R Bera, K Kanellopoulos, A Nori, T Shahroodi… - MICRO-54: 54th Annual …, 2021 - dl.acm.org

Past research has proposed numerous hardware prefetching techniques, most of which rely
on exploiting one specific type of program context information (eg, program counter …

Сохранить Цитировать Цитируется: 91 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The championship simulator: Architectural simulation for education and competition

N Gober, G Chacon, L Wang, PV Gratz… - arxiv preprint arxiv …, 2022 - arxiv.org

Recent years have seen a dramatic increase in the microarchitectural complexity of
processors. This increase in complexity presents a twofold challenge for the field of …

Сохранить Цитировать Цитируется: 59 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] cam.ac.uk

Decoupled vector runahead

A Naithani, J Roelandts, S Ainsworth… - Proceedings of the 56th …, 2023 - dl.acm.org

We present Decoupled Vector Runahead (DVR), an in-core prefetching technique,
executing separately to the main application thread, that exploits massive amounts of …

Сохранить Цитировать Цитируется: 13 Похожие статьи Все версии статьи (9)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

AfterImage: Leaking control flow data and tracking load operations via the hardware prefetcher

Y Chen, L Pei, TE Carlson - Proceedings of the 28th ACM International …, 2023 - dl.acm.org

Research into processor-based side-channels has seen both a large number and a large
variety of disclosed vulnerabilities that can leak critical, private data to malicious attackers …

Сохранить Цитировать Цитируется: 24 Похожие статьи Все версии статьи (3)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hermes: Accelerating long-latency load requests via perceptron-based off-chip load prediction

R Bera, K Kanellopoulos… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org

Long-latency load requests continue to limit the performance of modern high-performance
processors. To increase the latency tolerance of a processor, architects have primarily relied …

Сохранить Цитировать Цитируется: 27 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[PDF] iitb.ac.in

Clip: Load criticality based data prefetching for bandwidth-constrained many-core systems

B Panda - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org

Hardware prefetching is a latency-hiding technique that hides the costly off-chip DRAM
accesses. However, state-of-the-art prefetchers fail to deliver performance improvement in …

Сохранить Цитировать Цитируется: 11 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[PDF] nsf.gov

Effective mimicry of belady's min policy

I Shah, A Jain, C Lin - 2022 IEEE International Symposium on …, 2022 - ieeexplore.ieee.org

The past decade has seen the rise of highly successful cache replacement policies that are
based on binary prediction. For example, the Hawkeye policy learns whether lines loaded …

Сохранить Цитировать Цитируется: 34 Похожие статьи Все версии статьи (8)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making

G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org

Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …

Сохранить Цитировать Цитируется: 9 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[PDF] um.es

Berti: an accurate local-delta data prefetcher

A Navarro-Torres, B Panda… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org

Data prefetching is a technique that plays a crucial role in modern high-performance
processors by hiding long latency memory accesses. Several state-of-the-art hardware …

Сохранить Цитировать Цитируется: 41 Похожие статьи Все версии статьи (10)

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Snake: A variable-length chain-based prefetching for gpus

S Mostofi, H Falahati, N Mahani… - Proceedings of the 56th …, 2023 - dl.acm.org

Graphics Processing Units (GPUs) utilize memory hierarchy and Thread-Level Parallelism
(TLP) to tolerate off-chip memory latency, which is a significant bottleneck for memory-bound …

Сохранить Цитировать Цитируется: 5 Похожие статьи Все версии статьи (4)

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Bouquet of instruction pointers: Instruction pointer classifier-based spatial hardware prefetching

Pythia: A customizable hardware prefetching framework using online reinforcement learning

The championship simulator: Architectural simulation for education and competition

Decoupled vector runahead

AfterImage: Leaking control flow data and tracking load operations via the hardware prefetcher

Hermes: Accelerating long-latency load requests via perceptron-based off-chip load prediction

Clip: Load criticality based data prefetching for bandwidth-constrained many-core systems

Effective mimicry of belady's min policy

Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making

Berti: an accurate local-delta data prefetcher

Snake: A variable-length chain-based prefetching for gpus