DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks

GF Oliveira, J Gómez-Luna, L Orosa, S Ghose… - IEEE …, 2021 - ieeexplore.ieee.org
Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …

Near-memory computing: Past, present, and future

G Singh, L Chelini, S Corda, AJ Awan, S Stuijk… - Microprocessors and …, 2019 - Elsevier
The conventional approach of moving data to the CPU for computation has become a
significant performance bottleneck for emerging scale-out data-intensive applications due to …

At the locus of performance: Quantifying the effects of copious 3D-stacked cache on HPC workloads

J Domke, E Vatai, B Gerofi, Y Kodama… - ACM Transactions on …, 2023 - dl.acm.org
Over the last three decades, innovations in the memory subsystem were primarily targeted at
overcoming the data movement bottleneck. In this paper, we focus on a specific market trend …

Nmpo: Near-memory computing profiling and offloading

S Corda, M Kumaraswamy, AJ Awan… - 2021 24th Euromicro …, 2021 - ieeexplore.ieee.org
Real-world applications are now processing big-data sets, often bottlenecked by the data
movement between the compute units and the main memory. Near-memory computing …

[PDF][PDF] At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads

J Domke, E Vatai, B Gerofi, Y Kodama… - arxiv preprint arxiv …, 2022 - researchgate.net
Over the last three decades, innovations in the memory subsystem were primarily targeted at
overcoming the data movement bottleneck. In this paper, we focus on a specific market trend …

Platform independent software analysis for near memory computing

S Corda, G Singh, AJ Awan, R Jordans… - … on Digital System …, 2019 - ieeexplore.ieee.org
Near-memory Computing (NMC) promises improved performance for the applications that
can exploit the features of emerging memory technologies such as 3D-stacked memory …

Near memory acceleration on high resolution radio astronomy imaging

S Corda, B Veenboer, AJ Awan… - 2020 9th …, 2020 - ieeexplore.ieee.org
Modern radio telescopes like the Square Kilometer Array (SKA) will need to process in real-
time exabytes of radio-astronomical signals to construct a high-resolution map of the sky …

[PDF][PDF] Characterization and Acceleration of High Performance Compute Workloads

S Corda - 2022 - research.tue.nl
Modern big-data workloads have demanding performance requirements. This leads to
compute and memory bottlenecks. These applications comprise, among others, radio …