A modern primer on processing in memory

O Mutlu, S Ghose, J Gómez-Luna… - … computing: from devices …, 2022 - Springer
Modern computing systems are overwhelmingly designed to move data to computation. This
design choice goes directly against at least three key trends in computing that cause …

Managing Memory Tiers with {CXL} in Virtualized Environments

Y Zhong, DS Berger, C Waldspurger, R Wee… - … USENIX Symposium on …, 2024 - usenix.org
Cloud providers seek to deploy CXL-based memory to increase aggregate memory
capacity, reduce costs, and lower carbon emissions. However, CXL accesses incur higher …

Detecting and mitigating data-dependent DRAM failures by exploiting current memory content

S Khan, C Wilkerson, Z Wang, AR Alameldeen… - Proceedings of the 50th …, 2017 - dl.acm.org
DRAM cells in close proximity can fail depending on the data content in neighboring cells.
These failures are called data-dependent failures. Detecting and mitigating these failures …

Utility-based hybrid memory management

Y Li, S Ghose, J Choi, J Sun, H Wang… - … Conference on Cluster …, 2017 - ieeexplore.ieee.org
While the memory footprints of cloud and HPC applications continue to increase,
fundamental issues with DRAM scaling are likely to prevent traditional main memory …

Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency

R Ausavarungnirun, V Miller, J Landgraf… - ACM SIGPLAN …, 2018 - dl.acm.org
Graphics Processing Units (GPUs) exploit large amounts of threadlevel parallelism to
provide high instruction throughput and to efficiently hide long-latency stalls. The resulting …

Bandwidth-effective dram cache for gpu s with storage-class memory

J Hong, S Cho, G Park, W Yang… - … Symposium on High …, 2024 - ieeexplore.ieee.org
We propose overcoming the memory capacity limitation of GPUs with high-capacity Storage-
Class Memory (SCM) and DRAM cache. By significantly increasing the memory capacity …

Panthera: Holistic memory management for big data processing over hybrid memories

C Wang, H Cui, T Cao, J Zigman, H Volos… - Proceedings of the 40th …, 2019 - dl.acm.org
Modern data-parallel systems such as Spark rely increasingly on in-memory computing that
can significantly improve the efficiency of iterative algorithms. To process real-world …

Understanding and improving the latency of DRAM-based memory systems

KK Chang - 2017 - search.proquest.com
Over the past two decades, the storage capacity and access bandwidth of main memory
have improved tremendously, by 128x and 20x, respectively. These improvements are …

Chameleon: A dynamically reconfigurable heterogeneous memory system

JB Kotra, H Zhang, AR Alameldeen… - 2018 51st Annual …, 2018 - ieeexplore.ieee.org
Modern computing systems and applications have growing demand for memories with
higher bandwidth. This demand can be alleviated using fast, large on-die or die-stacked …

Errors in flash-memory-based solid-state drives: Analysis, mitigation, and recovery

Y Cai, S Ghose, EF Haratsch, Y Luo, O Mutlu - arxiv preprint arxiv …, 2017 - arxiv.org
NAND flash memory is ubiquitous in everyday life today because its capacity has
continuously increased and cost has continuously decreased over decades. This positive …