A mess of memory system benchmarking, simulation and application profiling
The Memory stress (Mess) framework provides a unified view of the memory system
benchmarking, simulation and application profiling. The Mess benchmark provides a holistic …
benchmarking, simulation and application profiling. The Mess benchmark provides a holistic …
Don't forget the I/O when allocating your LLC
In modern server CPUs, last-level cache (LLC) is a critical hardware resource that exerts
significant influence on the performance of the workloads, and how to manage LLC is a key …
significant influence on the performance of the workloads, and how to manage LLC is a key …
Quantifying memory underutilization in hpc systems and using it to improve performance via architecture support
A system's memory size is often dictated by worst-case workloads with highest memory
requirements; this causes memory to be underutilized in the common case when the system …
requirements; this causes memory to be underutilized in the common case when the system …
Measuring the impact of memory errors on application performance
Memory reliability is a key factor in the design of warehouse-scale computers. Prior work has
focused on the performance overheads of memory fault-tolerance schemes when errors do …
focused on the performance overheads of memory fault-tolerance schemes when errors do …
Hopscotch: a micro-benchmark suite for memory performance evaluation
Due to the ever-increasing gap between the speed of processing elements and the speed at
which memory systems can feed them with data, current computing systems are often …
which memory systems can feed them with data, current computing systems are often …
Performance analysis and optimization with little's law
S Mehta - 2022 IEEE International Symposium on Performance …, 2022 - ieeexplore.ieee.org
Performance tools are the bridge between processor architecture and a user. However, with
the increasingly complex processor architectures, it is becoming increasingly difficult for the …
the increasingly complex processor architectures, it is becoming increasingly difficult for the …
Multiple physical map**s: Dynamic DRAM channel sharing and partitioning
When an OS allocates memory to a process, it implicitly performs long-term scheduling on
DRAM resources such as channels and banks: Each mapped page frame allows memory …
DRAM resources such as channels and banks: Each mapped page frame allows memory …
Patching up network data leaks with sweeper
Datacenters have witnessed a staggering evolution in networking technologies, driven by
insatiable application demands for larger datasets and inter-server data transfers. Modern …
insatiable application demands for larger datasets and inter-server data transfers. Modern …
An energy-balanced clustering routing algorithm for wireless sensor networks
Y Lan, C **uli, W Meng - 2009 WRI World Congress on …, 2009 - ieeexplore.ieee.org
When cluster heads transmit their data to the sink via multi-hop communication, the cluster
heads closer to the sink not only gather the data in its cluster but also forward the data for …
heads closer to the sink not only gather the data in its cluster but also forward the data for …
A Quantitative Analysis and Guidelines of Data Streaming Accelerator in Modern Intel Xeon Scalable Processors
As semiconductor power density is no longer constant with the technology process scaling
down, we need different solutions if we are to continue scaling application performance. To …
down, we need different solutions if we are to continue scaling application performance. To …