- Academic Search

GF Oliveira, J Gómez-Luna, L Orosa, S Ghose… - IEEE …, 2021 - ieeexplore.ieee.org

Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …

Opslaan Citeren Geciteerd door 108 Verwante artikelen Alle 10 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

C Giannoula, I Fernandez, J Gómez-Luna… - ACM SIGMETRICS …, 2022 - dl.acm.org

Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …

Opslaan Citeren Geciteerd door 45 Verwante artikelen Alle 10 versies

Advancements in accelerating deep neural network inference on aiot devices: A survey

L Cheng, Y Gu, Q Liu, L Yang, C Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

The amalgamation of artificial intelligence with Internet of Things (AIoT) devices have seen a
rapid surge in growth, largely due to the effective implementation of deep neural network …

Opslaan Citeren Geciteerd door 21 Verwante artikelen Alle 4 versies

[Free GPT-4]
[DeepSeek]

[PDF] washington.edu

RAMBDA: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications

Y Yuan, J Huang, Y Sun, T Wang… - … Symposium on High …, 2023 - ieeexplore.ieee.org

Responding to the" datacenter tax" and" killer microseconds" problems for memory-intensive
datacenter applications, diverse solutions including Smart NIC-based ones have been …

Opslaan Citeren Geciteerd door 26 Verwante artikelen Alle 6 versies

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

A survey of resource management for processing-in-memory and near-memory processing architectures

K Khan, S Pasricha, RG Kim - Journal of Low Power Electronics and …, 2020 - mdpi.com

Due to the amount of data involved in emerging deep learning and big data applications,
operations related to data movement have quickly become a bottleneck. Data-centric …

Opslaan Citeren Geciteerd door 25 Verwante artikelen Alle 9 versies In cache

[Free GPT-4]
[DeepSeek]

[PDF] cam.ac.uk

Decoupled vector runahead

A Naithani, J Roelandts, S Ainsworth… - Proceedings of the 56th …, 2023 - dl.acm.org

We present Decoupled Vector Runahead (DVR), an in-core prefetching technique,
executing separately to the main application thread, that exploits massive amounts of …

Opslaan Citeren Geciteerd door 13 Verwante artikelen Alle 9 versies

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Casper: Accelerating stencil computations using near-cache processing

A Denzler, GF Oliveira, N Ha**azar, R Bera… - IEEE …, 2023 - ieeexplore.ieee.org

Stencil computations are commonly used in a wide variety of scientific applications, ranging
from large-scale weather prediction to solving partial differential equations. Stencil …

Opslaan Citeren Geciteerd door 43 Verwante artikelen Alle 5 versies

[Free GPT-4]
[DeepSeek]

[PDF] tsinghua.edu.cn

NDPBridge: Enabling Cross-Bank Coordination in Near-DRAM-Bank Processing Architectures

B Tian, Y Li, L Jiang, S Cai… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org

Various near-data processing (NDP) designs have been proposed to alleviate the memory
wall challenge for data-intensive applications. Among them, near-DRAM-bank NDP …

Opslaan Citeren Geciteerd door 4 Verwante artikelen Alle 3 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dalorex: A data-local program execution and architecture for memory-bound applications

M Orenes-Vera, E Tureci, D Wentzlaff… - … Symposium on High …, 2023 - ieeexplore.ieee.org

Applications with low data reuse and frequent irregular memory accesses, such as graph or
sparse linear algebra workloads, fail to scale well due to memory bottlenecks and poor core …

Opslaan Citeren Geciteerd door 24 Verwante artikelen Alle 5 versies

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Infinity stream: Portable and programmer-friendly in-/near-memory fusion

Z Wang, C Liu, A Arora, L John… - Proceedings of the 28th …, 2023 - dl.acm.org

In-memory computing with large last-level caches is promising to dramatically alleviate data
movement bottlenecks and expose massive bitline-level parallelization opportunities …

Opslaan Citeren Geciteerd door 14 Verwante artikelen Alle 6 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Livia: Data-centric computing throughout the memory hierarchy

DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks

Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

Advancements in accelerating deep neural network inference on aiot devices: A survey

RAMBDA: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications

A survey of resource management for processing-in-memory and near-memory processing architectures

Decoupled vector runahead

Casper: Accelerating stencil computations using near-cache processing

NDPBridge: Enabling Cross-Bank Coordination in Near-DRAM-Bank Processing Architectures

Dalorex: A data-local program execution and architecture for memory-bound applications

Infinity stream: Portable and programmer-friendly in-/near-memory fusion