Working set analytics
PJ Denning - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The working set model for program behavior was invented in 1965. It has stood the test of
time in virtual memory management for over 50 years. It is considered the ideal for …
time in virtual memory management for over 50 years. It is considered the ideal for …
Improving the performance of bagging ensembles for data streams through mini-batching
Often, machine learning applications have to cope with dynamic environments where data
are collected in the form of continuous data streams with potentially infinite length and …
are collected in the form of continuous data streams with potentially infinite length and …
Parallel Loop Locality Analysis for Symbolic Thread Counts
Data movement limits program performance. This bottleneck is more significant in multi-
thread programs but more difficult to analyze, especially for multiple thread counts. For …
thread programs but more difficult to analyze, especially for multiple thread counts. For …
CARL: Compiler assigned reference leasing
Data movement is a common performance bottleneck, and its chief remedy is caching.
Traditional cache management is transparent to the workload: data that should be kept in …
Traditional cache management is transparent to the workload: data that should be kept in …
Blast from the Past: Least Expected Use (LEU) Cache Replacement with Statistical History
Cache replacement policies typically use some form of statistics on past access behavior. As
a common limitation, however, the extent of the history being recorded is limited to either just …
a common limitation, however, the extent of the history being recorded is limited to either just …
CLAM: Compiler lease of cache memory
I Prechtl, B Reber, C Ding, D Patru… - Proceedings of the …, 2020 - dl.acm.org
Traditional caching is transparent to software but cannot utilize program information directly.
With Moore's Law ending and general-purpose processor speed plateauing, there is …
With Moore's Law ending and general-purpose processor speed plateauing, there is …
Balancing performance and energy consumption of bagging ensembles for the classification of data streams in edge computing
In recent years, the Edge Computing (EC) paradigm has emerged as an enabling factor for
develo** technologies like the Internet of Things (IoT) and 5G networks, bridging the gap …
develo** technologies like the Internet of Things (IoT) and 5G networks, bridging the gap …
MemFriend: Understanding Memory Performance with Spatial-Temporal Affinity
Y Suriyakumar, NR Tallent, A Marquez… - Proceedings of the …, 2024 - dl.acm.org
In HPC applications, memory access behavior is one of the main factors affecting
performance. Improving an application's memory access behavior requires studying spatial …
performance. Improving an application's memory access behavior requires studying spatial …
Uniform lease vs. LRU cache: Analysis and evaluation
Lease caching is a new technique that provides greater control of the cache than what is
allowed in conventional caches. The simplest control is uniform lease (UL), which means …
allowed in conventional caches. The simplest control is uniform lease (UL), which means …
MemGaze: Rapid and Effective Load-Level Memory Trace Analysis
A challenge of memory trace analysis is combining detailed analysis and low overhead
measurement. Currently, hardware/software-based analysis of load-level sequences easily …
measurement. Currently, hardware/software-based analysis of load-level sequences easily …