A modern primer on processing in memory

O Mutlu, S Ghose, J Gómez-Luna… - … computing: from devices …, 2022‏ - Springer
Modern computing systems are overwhelmingly designed to move data to computation. This
design choice goes directly against at least three key trends in computing that cause …

Processing data where it makes sense: Enabling in-memory computation

O Mutlu, S Ghose, J Gómez-Luna… - Microprocessors and …, 2019‏ - Elsevier
Today's systems are overwhelmingly designed to move data to computation. This design
choice goes directly against at least three key trends in systems that cause performance …

Near-memory computing: Past, present, and future

G Singh, L Chelini, S Corda, AJ Awan, S Stuijk… - Microprocessors and …, 2019‏ - Elsevier
The conventional approach of moving data to the CPU for computation has become a
significant performance bottleneck for emerging scale-out data-intensive applications due to …

Syncron: Efficient synchronization support for near-data-processing architectures

C Giannoula, N Vijaykumar… - … Symposium on High …, 2021‏ - ieeexplore.ieee.org
Near-Data-Processing (NDP) architectures present a promising way to alleviate data
movement costs and can provide significant performance and energy benefits to parallel …

Performance optimization for edge-cloud serverless platforms via dynamic task placement

A Das, S Imai, S Patterson… - 2020 20th IEEE/ACM …, 2020‏ - ieeexplore.ieee.org
We present a framework for performance optimization in serverless edge-cloud platforms
using dynamic task placement. We focus on applications for smart edge devices, for …

Understanding and improving the latency of DRAM-based memory systems

KK Chang - 2017‏ - search.proquest.com
Over the past two decades, the storage capacity and access bandwidth of main memory
have improved tremendously, by 128x and 20x, respectively. These improvements are …

Revisiting the design of data stream processing systems on multi-core processors

S Zhang, B He, D Dahlmeier, AC Zhou… - 2017 IEEE 33rd …, 2017‏ - ieeexplore.ieee.org
Driven by the rapidly increasing demand for handling real-time data streams, many data
stream processing (DSP) systems have been proposed. Regardless of the different …

Runtime object lifetime profiler for latency sensitive big data applications

R Bruno, D Patricio, J Simão, L Veiga… - Proceedings of the …, 2019‏ - dl.acm.org
Latency sensitive services such as credit-card fraud detection and website targeted
advertisement rely on Big Data platforms which run on top of memory managed runtimes …

Improving spark application throughput via memory aware task co-location: A mixture of experts approach

VS Marco, B Taylor, B Porter, Z Wang - … of the 18th ACM/IFIP/USENIX …, 2017‏ - dl.acm.org
Data analytic applications built upon big data processing frameworks such as Apache Spark
are an important class of applications. Many of these applications are not latency-sensitive …

BDEv 3.0: energy efficiency and microarchitectural characterization of Big Data processing frameworks

J Veiga, J Enes, RR Expósito, J Tourino - Future Generation Computer …, 2018‏ - Elsevier
As the size of Big Data workloads keeps increasing, the evaluation of distributed frameworks
becomes a crucial task in order to identify potential performance bottlenecks that may delay …