[HTML][HTML] Deep neural networks in the cloud: Review, applications, challenges and research directions

KY Chan, B Abu-Salih, R Qaddoura, AZ Ala'M… - Neurocomputing, 2023 - Elsevier
Deep neural networks (DNNs) are currently being deployed as machine learning technology
in a wide range of important real-world applications. DNNs consist of a huge number of …

Asynchronous online federated learning for edge devices with non-iid data

Y Chen, Y Ning, M Slawski… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Federated learning (FL) is a machine learning paradigm where a shared central model is
learned across distributed devices while the training data remains on these devices …

A large-scale analysis of hundreds of in-memory key-value cache clusters at twitter

J Yang, Y Yue, KV Rashmi - ACM Transactions on Storage (TOS), 2021 - dl.acm.org
Modern web services use in-memory caching extensively to increase throughput and reduce
latency. There have been several workload analyses of production systems that have fueled …

Lessons learned from the chameleon testbed

K Keahey, J Anderson, Z Zhen, P Riteau… - 2020 USENIX annual …, 2020 - usenix.org
The Chameleon testbed is a case study in adapting the cloud paradigm for computer
science research. In this paper, we explain how this adaptation was achieved, evaluate it …

{AIFM}:{High-Performance},{Application-Integrated} far memory

Z Ruan, M Schwarzkopf, MK Aguilera… - 14th USENIX Symposium …, 2020 - usenix.org
Memory is the most contended and least elastic resource in datacenter servers today.
Applications can use only local memory—which may be scarce—even though memory …

Caladan: Mitigating interference at microsecond timescales

J Fried, Z Ruan, A Ousterhout, A Belay - 14th USENIX Symposium on …, 2020 - usenix.org
The conventional wisdom is that CPU resources such as cores, caches, and memory
bandwidth must be partitioned to achieve performance isolation between tasks. Both the …

Rethinking software runtimes for disaggregated memory

I Calciu, MT Imran, I Puddu, S Kashyap… - Proceedings of the 26th …, 2021 - dl.acm.org
Disaggregated memory can address resource provisioning inefficiencies in current
datacenters. Multiple software runtimes for disaggregated memory have been proposed in …

Atoll: A scalable low-latency serverless platform

A Singhvi, A Balasubramanian, K Houck… - Proceedings of the …, 2021 - dl.acm.org
With user-facing apps adopting serverless computing, good latency performance of
serverless platforms has become a strong fundamental requirement. However, it is difficult to …

Practical gan-based synthetic ip header trace generation using netshare

Y Yin, Z Lin, M **, G Fanti, V Sekar - Proceedings of the ACM SIGCOMM …, 2022 - dl.acm.org
We explore the feasibility of using Generative Adversarial Networks (GANs) to automatically
learn generative models to generate synthetic packet-and flow header traces for networking …

FIFO queues are all you need for cache eviction

J Yang, Y Zhang, Z Qiu, Y Yue, R Vinayak - Proceedings of the 29th …, 2023 - dl.acm.org
As a cache eviction algorithm, FIFO has a lot of attractive properties, such as simplicity,
speed, scalability, and flash-friendliness. The most prominent criticism of FIFO is its low …