Študovňa Google

S Mittal - Journal of Low Power Electronics and Applications, 2016 - mdpi.com

With increasing core-count, the cache demand of modern processors has also increased.
However, due to strict area/power budgets and presence of poor data-locality workloads …

Uložiť Citovať Citované 50-krát Súvisiace články Všetky verzie 9 V pamäti

[KNIHA][B] General-purpose graphics processor architectures

TM Aamodt, WWL Fung, TG Rogers, M Martonosi - 2018 - Springer

Originally developed to support video games, graphics processor units (GPUs) are now
increasingly used for general-purpose (non-graphics) applications ranging from machine …

Uložiť Citovať Citované 107-krát Súvisiace články Všetky verzie 4 Vyhľadávanie knižnice

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency

R Ausavarungnirun, V Miller, J Landgraf… - ACM SIGPLAN …, 2018 - dl.acm.org

Graphics Processing Units (GPUs) exploit large amounts of threadlevel parallelism to
provide high instruction throughput and to efficiently hide long-latency stalls. The resulting …

Uložiť Citovať Citované 114-krát Súvisiace články Všetky verzie 26

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Locality-driven dynamic GPU cache bypassing

C Li, SL Song, H Dai, A Sidelnik, SKS Hari… - Proceedings of the 29th …, 2015 - dl.acm.org

This paper presents novel cache optimizations for massively parallel, throughput-oriented
architectures like GPUs. L1 data caches (L1 D-caches) are critical resources for providing …

Uložiť Citovať Citované 142-krát Súvisiace články Všetky verzie 6

CAWA: Coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads

SY Lee, A Arunkumar, CJ Wu - ACM SIGARCH Computer Architecture …, 2015 - dl.acm.org

The ubiquity of graphics processing unit (GPU) architectures has made them efficient
alternatives to chip-multiprocessors for parallel workloads. GPUs achieve superior …

Uložiť Citovať Citované 126-krát Súvisiace články Všetky verzie 6

[Free GPT-4]
[DeepSeek]

[PDF] tu-dresden.de

Locality-aware CTA clustering for modern GPUs

A Li, SL Song, W Liu, X Liu, A Kumar… - ACM SIGARCH …, 2017 - dl.acm.org

Cache is designed to exploit locality; however, the role of on-chip L1 data caches on modern
GPUs is often awkward. The locality among global memory requests from different SMs …

Uložiť Citovať Citované 97-krát Súvisiace články Všetky verzie 14

Flexminer: A pattern-aware accelerator for graph pattern mining

X Chen, T Huang, S Xu, T Bourgeat… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org

Graph pattern mining (GPM) is a class of algorithms widely used in many real-world
applications in bio-medicine, e-commerce, security, social sciences, etc. GPM is a …

Uložiť Citovať Citované 38-krát Súvisiace články Všetky verzie 3

[Free GPT-4]
[DeepSeek]

[PDF] wiley.com Full View

Survey on memory management techniques in heterogeneous computing systems

A Hazarika, S Poddar… - IET Computers & Digital …, 2020 - Wiley Online Library

A major issue faced by data scientists today is how to scale up their processing infrastructure
to meet the challenge of big data and high‐performance computing (HPC) workloads. With …

Uložiť Citovať Citované 21-krát Súvisiace články Všetky verzie 6

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Access pattern-aware cache management for improving data utilization in GPU

G Koo, Y Oh, WW Ro, M Annavaram - Proceedings of the 44th annual …, 2017 - dl.acm.org

Long latency of memory operation is a prominent performance bottleneck in graphics
processing units (GPUs). The small data cache that must be shared across dozens of warps …

Uložiť Citovať Citované 88-krát Súvisiace články Všetky verzie 8

[Free GPT-4]
[DeepSeek]

[PDF] cmu.edu

The locality descriptor: A holistic cross-layer abstraction to express data locality in GPUs

N Vijaykumar, E Ebrahimi, K Hsieh… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org

Exploiting data locality in GPUs is critical to making more efficient use of the existing caches
and the NUMA-based memory hierarchy expected in future GPUs. While modern GPU …

Uložiť Citovať Citované 79-krát Súvisiace články Všetky verzie 9

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Adaptive cache management for energy-efficient GPU computing

[HTML][HTML] A survey of cache bypassing techniques

[KNIHA][B] General-purpose graphics processor architectures

Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency

Locality-driven dynamic GPU cache bypassing

CAWA: Coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads

Locality-aware CTA clustering for modern GPUs

Flexminer: A pattern-aware accelerator for graph pattern mining

Survey on memory management techniques in heterogeneous computing systems

Access pattern-aware cache management for improving data utilization in GPU

The locality descriptor: A holistic cross-layer abstraction to express data locality in GPUs