Cache-conscious wavefront scheduling

TG Rogers, M O'Connor… - 2012 45th Annual IEEE …, 2012 - ieeexplore.ieee.org
This paper studies the effects of hardware thread scheduling on cache management in
GPUs. We propose Cache-Conscious Wave front Scheduling (CCWS), an adaptive …

Divergence-aware warp scheduling

TG Rogers, M O'Connor, TM Aamodt - … of the 46th Annual IEEE/ACM …, 2013 - dl.acm.org
This paper uses hardware thread scheduling to improve the performance and energy
efficiency of divergent applications on GPUs. We propose Divergence-Aware Warp …

Low-latency, high-throughput garbage collection

W Zhao, SM Blackburn, KS McKinley - Proceedings of the 43rd ACM …, 2022 - dl.acm.org
To achieve short pauses, state-of-the-art concurrent copying collectors such as C4,
Shenandoah, and ZGC use substantially more CPU cycles and memory than simpler …

Performance analysis of content matching intrusion detection systems

S Antonatos, KG Anagnostakis… - … on Applications and …, 2004 - ieeexplore.ieee.org
Although network intrusion detection systems (nIDS) are widely used, there is limited
understanding of how these systems perform in different settings and how they should be …

GPUs as an opportunity for offloading garbage collection

M Maas, P Reames, J Morlan, K Asanović… - ACM SIGPLAN …, 2012 - dl.acm.org
GPUs have become part of most commodity systems. Nonetheless, they are often
underutilized when not executing graphics-intensive or special-purpose numerical …

Data structure aware garbage collector

N Cohen, E Petrank - Proceedings of the 2015 International Symposium …, 2015 - dl.acm.org
Garbage collection may benefit greatly from knowledge about program behavior, but most
managed languages do not provide means for the programmer to deliver such knowledge …

Memory management for many-core processors with software configurable locality policies

J Zhou, B Demsky - ACM SIGPLAN Notices, 2012 - dl.acm.org
As processors evolve towards higher core counts, architects will develop more sophisticated
memory systems to satisfy the cores' increasing thirst for memory bandwidth. Early many …

On the limits of modeling generational garbage collector performance

P Libič, L Bulej, V Horky, P Tůma - Proceedings of the 5th ACM/SPEC …, 2014 - dl.acm.org
Garbage collection is an element of many contemporary software platforms whose
performance is determined by complex interactions and is therefore difficult to quantify and …

Improving Garbage Collection Observability with Performance Tracing

C Huang, S Blackburn, Z Cai - Proceedings of the 20th ACM SIGPLAN …, 2023 - dl.acm.org
Debugging garbage collectors for performance and correctness is notoriously difficult.
Among the arsenal of tools available to systems engineers, support for one of the most …

Linear-Mark: Locality vs. Accuracy in Mark-Sweep Garbage Collection

C Meiohas, SM Blackburn, E Petrank - Proceedings of the International …, 2023 - dl.acm.org
Tracing garbage collectors are widely deployed in modern programming languages. But
tracing an arbitrary heap shape incurs poor locality and may hinder scalability. In this paper …