An introduction to the compute express link (cxl) interconnect

D Das Sharma, R Blankenship, D Berger - ACM Computing Surveys, 2024 - dl.acm.org
The Compute Express Link (CXL) is an open industry-standard interconnect between
processors and devices such as accelerators, memory buffers, smart network interfaces …

Fifo queues are all you need for cache eviction

J Yang, Y Zhang, Z Qiu, Y Yue, R Vinayak - Proceedings of the 29th …, 2023 - dl.acm.org
As a cache eviction algorithm, FIFO has a lot of attractive properties, such as simplicity,
speed, scalability, and flash-friendliness. The most prominent criticism of FIFO is its low …

{SIEVE} is simpler than {LRU}: an efficient {Turn-Key} eviction algorithm for web caches

Y Zhang, J Yang, Y Yue, Y Vigfusson… - 21st USENIX Symposium …, 2024 - usenix.org
Caching is an indispensable technique for low-cost and fast data serving. The eviction
algorithm, at the heart of a cache, has been primarily designed to maximize efficiency …

FIFO can be Better than LRU: the Power of Lazy Promotion and Quick Demotion

J Yang, Z Qiu, Y Zhang, Y Yue, KV Rashmi - … of the 19th Workshop on Hot …, 2023 - dl.acm.org
LRU has been the basis of cache eviction algorithms for decades, with a plethora of
innovations on improving LRU's miss ratio and throughput. While it is well-known that FIFO …

Cocktail edge caching: Ride dynamic trends of content popularity with ensemble learning

T Zong, C Li, Y Lei, G Li, H Cao… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Edge caching will play a critical role in facilitating the emerging content-rich applications.
However, it faces many new challenges, in particular, the highly dynamic content popularity …

TTLs matter: Efficient cache sizing with TTL-aware miss ratio curves and working set sizes

S Sultan, K Shakiba, A Lee, P Chen… - Proceedings of the …, 2024 - dl.acm.org
In-memory caches play a pivotal role in optimizing distributed systems by significantly
reducing query response times. Correctly sizing these caches is critical, especially …

Agile cache replacement in edge computing via offline-online deep reinforcement learning

Z Wang, J Hu, G Min, Z Zhao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
One fundamental problem of content caching in edge computing is how to replace contents
in edge servers with limited capacities to meet the dynamic requirements of users without …

Seer: Enabling {Future-Aware} Online Caching in Networked Systems

J Lei, V Shrivastav - 21st USENIX Symposium on Networked Systems …, 2024 - usenix.org
State-intensive network and distributed applications rely heavily on online caching heuristics
for high performance. However, there remains a fundamental performance gap between …

AI-driven proactive content caching for 6G

G Cheng, C Jiang, B Yue, R Wang… - IEEE Wireless …, 2023 - ieeexplore.ieee.org
To address the limitations of the current proactive content caching technology for the 6th
generation (6G) mobile network, this article comprehensively analyzes the complex …

FLOWS: Balanced MRC profiling for heterogeneous object-size cache

X Guo, H Wang, K Zhou, H Jiang, Y Han… - Proceedings of the …, 2024 - dl.acm.org
While Miss Ratio Curve (MRC) profiling methods based on spatial sampling are effective in
modeling cache behaviors, previous MRC studies lack in-depth analysis of profiling errors …