An overview of analysis methods and evaluation results for caching strategies

G Hasslinger, M Okhovatzadeh, K Ntougias… - Computer Networks, 2023 - Elsevier
We survey analytical methods and evaluation results for the performance assessment of
caching strategies. Knapsack solutions are derived, which provide static caching bounds for …

Efficient computation of optimal thresholds in cloud auto-scaling systems

T Tournaire, H Castel-Taleb, E Hyon - ACM Transactions on Modeling …, 2023 - dl.acm.org
We consider a horizontal and dynamic auto-scaling technique in a cloud system where
virtual machines hosted on a physical node are turned on and off to minimise energy …

Time-to-live caching with network delays: Exact analysis and computable approximations

K Elsayed, A Rizk - IEEE/ACM Transactions on Networking, 2022 - ieeexplore.ieee.org
We consider Time-to-Live (TTL) caches that tag every object in cache with a specific (and
possibly renewable) expiration time. State-of-the-art models for TTL caches assume zero …

Optimal edge caching for individualized demand dynamics

G Quan, A Eryilmaz, NB Shroff - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org
The ever-growing end user data demands, and the reductions in memory costs are fueling
edge-caching deployments. Caching at the edge is substantially different from that at the …

Introducing Super RAGs in Mistral 8x7B-v1

A Thakur, R Gupta - arxiv preprint arxiv:2404.08940, 2024 - arxiv.org
The relentless pursuit of enhancing Large Language Models (LLMs) has led to the advent of
Super Retrieval-Augmented Generation (Super RAGs), a novel approach designed to …