[HTML][HTML] Deep neural networks in the cloud: Review, applications, challenges and research directions
Deep neural networks (DNNs) are currently being deployed as machine learning technology
in a wide range of important real-world applications. DNNs consist of a huge number of …
in a wide range of important real-world applications. DNNs consist of a huge number of …
Asynchronous online federated learning for edge devices with non-iid data
Federated learning (FL) is a machine learning paradigm where a shared central model is
learned across distributed devices while the training data remains on these devices …
learned across distributed devices while the training data remains on these devices …
A large-scale analysis of hundreds of in-memory key-value cache clusters at twitter
Modern web services use in-memory caching extensively to increase throughput and reduce
latency. There have been several workload analyses of production systems that have fueled …
latency. There have been several workload analyses of production systems that have fueled …
Lessons learned from the chameleon testbed
The Chameleon testbed is a case study in adapting the cloud paradigm for computer
science research. In this paper, we explain how this adaptation was achieved, evaluate it …
science research. In this paper, we explain how this adaptation was achieved, evaluate it …
{AIFM}:{High-Performance},{Application-Integrated} far memory
Memory is the most contended and least elastic resource in datacenter servers today.
Applications can use only local memory—which may be scarce—even though memory …
Applications can use only local memory—which may be scarce—even though memory …
Caladan: Mitigating interference at microsecond timescales
The conventional wisdom is that CPU resources such as cores, caches, and memory
bandwidth must be partitioned to achieve performance isolation between tasks. Both the …
bandwidth must be partitioned to achieve performance isolation between tasks. Both the …
Rethinking software runtimes for disaggregated memory
Disaggregated memory can address resource provisioning inefficiencies in current
datacenters. Multiple software runtimes for disaggregated memory have been proposed in …
datacenters. Multiple software runtimes for disaggregated memory have been proposed in …
Atoll: A scalable low-latency serverless platform
With user-facing apps adopting serverless computing, good latency performance of
serverless platforms has become a strong fundamental requirement. However, it is difficult to …
serverless platforms has become a strong fundamental requirement. However, it is difficult to …
Practical gan-based synthetic ip header trace generation using netshare
We explore the feasibility of using Generative Adversarial Networks (GANs) to automatically
learn generative models to generate synthetic packet-and flow header traces for networking …
learn generative models to generate synthetic packet-and flow header traces for networking …
FIFO queues are all you need for cache eviction
As a cache eviction algorithm, FIFO has a lot of attractive properties, such as simplicity,
speed, scalability, and flash-friendliness. The most prominent criticism of FIFO is its low …
speed, scalability, and flash-friendliness. The most prominent criticism of FIFO is its low …