Analysis and approximation of optimal co-scheduling on chip multiprocessors

Y Jiang, X Shen, J Chen, R Tripathi - Proceedings of the 17th …, 2008 - dl.acm.org
Cache sharing among processors is important for Chip Multiprocessors to reduce inter-
thread latency, but also brings cache contention, degrading program performance …

HOTL: a higher order theory of locality

X **ang, C Ding, H Luo, B Bao - Proceedings of the eighteenth …, 2013 - dl.acm.org
The locality metrics are many, for example, miss ratio to test performance, data footprint to
manage cache sharing, and reuse distance to analyze and optimize a program. It is unclear …

Kinetic modeling of data eviction in cache

X Hu, X Wang, L Zhou, Y Luo, C Ding… - 2016 USENIX Annual …, 2016 - usenix.org
The reuse distance (LRU stack distance) is an essential metric for performance prediction
and optimization of storage and CPU cache. Over the last four decades, there have been …

Accelerating multicore reuse distance analysis with sampling and parallelization

DL Schuff, M Kulkarni, VS Pai - … of the 19th international conference on …, 2010 - dl.acm.org
Reuse distance analysis is a well-established tool for predicting cache performance, driving
compiler optimizations, and assisting visualization and manual optimization of programs …

PARDA: A fast parallel reuse distance analysis algorithm

Q Niu, J Dinan, Q Lu… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
Reuse distance is a well established approach to characterizing data cache locality based
on the stack histogram model. This analysis so far has been restricted to offline use due to …

Linear-time modeling of program working set in shared cache

X **ang, B Bao, C Ding, Y Gao - 2011 International Conference …, 2011 - ieeexplore.ieee.org
Many techniques characterize the program working set by the notion of the program
footprint, which is the volume of data accessed in a time window. A complete …

Fast miss ratio curve modeling for storage cache

X Hu, X Wang, L Zhou, Y Luo, Z Wang… - ACM Transactions on …, 2018 - dl.acm.org
The reuse distance (least recently used (LRU) stack distance) is an essential metric for
performance prediction and optimization of storage cache. Over the past four decades, there …

Optimal cache partition-sharing

J Brock, C Ye, C Ding, Y Li, X Wang… - 2015 44th International …, 2015 - ieeexplore.ieee.org
When a cache is shared by multiple cores, its space may be allocated either by sharing,
partitioning, or both. We call the last case partition-sharing. This paper studies partition …

ReuseTracker: Fast Yet Accurate Multicore Reuse Distance Analyzer

MA Sasongko, M Chabbi, MB Marzijarani… - ACM Transactions on …, 2021 - dl.acm.org
One widely used metric that measures data locality is reuse distance—the number of unique
memory locations that are accessed between two consecutive accesses to a particular …

Locality analysis of graph reordering algorithms

MK Esfahani, P Kilpatrick… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
A major challenge in processing real-world graphs stems from poor locality of memory
accesses and vertex reordering algorithms (RAs) have been proposed to improve locality by …