A mess of memory system benchmarking, simulation and application profiling

P Esmaili-Dokht, F Sgherzi, VS Girelli… - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
The Memory stress (Mess) framework provides a unified view of the memory system
benchmarking, simulation and application profiling. The Mess benchmark provides a holistic …

Don't forget the I/O when allocating your LLC

Y Yuan, M Alian, Y Wang, R Wang… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
In modern server CPUs, last-level cache (LLC) is a critical hardware resource that exerts
significant influence on the performance of the workloads, and how to manage LLC is a key …

Quantifying memory underutilization in hpc systems and using it to improve performance via architecture support

G Panwar, D Zhang, Y Pang, M Dahshan… - Proceedings of the …, 2019 - dl.acm.org
A system's memory size is often dictated by worst-case workloads with highest memory
requirements; this causes memory to be underutilized in the common case when the system …

Measuring the impact of memory errors on application performance

M Gottscho, M Shoaib, S Govindan… - IEEE Computer …, 2016 - ieeexplore.ieee.org
Memory reliability is a key factor in the design of warehouse-scale computers. Prior work has
focused on the performance overheads of memory fault-tolerance schemes when errors do …

Hopscotch: a micro-benchmark suite for memory performance evaluation

A Ahmed, K Skadron - Proceedings of the International Symposium on …, 2019 - dl.acm.org
Due to the ever-increasing gap between the speed of processing elements and the speed at
which memory systems can feed them with data, current computing systems are often …

Performance analysis and optimization with little's law

S Mehta - 2022 IEEE International Symposium on Performance …, 2022 - ieeexplore.ieee.org
Performance tools are the bridge between processor architecture and a user. However, with
the increasingly complex processor architectures, it is becoming increasingly difficult for the …

Multiple physical map**s: Dynamic DRAM channel sharing and partitioning

M Hillenbrand, M Gottschlag, J Kehne… - Proceedings of the 8th …, 2017 - dl.acm.org
When an OS allocates memory to a process, it implicitly performs long-term scheduling on
DRAM resources such as channels and banks: Each mapped page frame allows memory …

Patching up network data leaks with sweeper

M Vemmou, A Cho, A Daglis - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Datacenters have witnessed a staggering evolution in networking technologies, driven by
insatiable application demands for larger datasets and inter-server data transfers. Modern …

An energy-balanced clustering routing algorithm for wireless sensor networks

Y Lan, C **uli, W Meng - 2009 WRI World Congress on …, 2009 - ieeexplore.ieee.org
When cluster heads transmit their data to the sink via multi-hop communication, the cluster
heads closer to the sink not only gather the data in its cluster but also forward the data for …

A Quantitative Analysis and Guidelines of Data Streaming Accelerator in Modern Intel Xeon Scalable Processors

R Kuper, I Jeong, Y Yuan, R Wang… - Proceedings of the 29th …, 2024 - dl.acm.org
As semiconductor power density is no longer constant with the technology process scaling
down, we need different solutions if we are to continue scaling application performance. To …