Serverless computing: state-of-the-art, challenges and opportunities

Y Li, Y Lin, Y Wang, K Ye, C Xu - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Serverless computing is growing in popularity by virtue of its lightweight and simplicity of
management. It achieves these merits by reducing the granularity of the computing unit to …

The serverless computing survey: A technical primer for design architecture

Z Li, L Guo, J Cheng, Q Chen, BS He… - ACM Computing Surveys …, 2022 - dl.acm.org
The development of cloud infrastructures inspires the emergence of cloud-native computing.
As the most promising architecture for deploying microservices, serverless computing has …

{INFaaS}: Automated model-less inference serving

F Romero, Q Li, NJ Yadwadkar… - 2021 USENIX Annual …, 2021 - usenix.org
Despite existing work in machine learning inference serving, ease-of-use and cost efficiency
remain challenges at large scales. Developers must manually search through thousands of …

{SkyPilot}: An intercloud broker for sky computing

Z Yang, Z Wu, M Luo, WL Chiang, R Bhardwaj… - … USENIX Symposium on …, 2023 - usenix.org
To comply with the increasing number of government regulations about data placement and
processing, and to protect themselves against major cloud outages, many users want the …

Autopilot: workload autoscaling at google

K Rzadca, P Findeisen, J Swiderski, P Zych… - Proceedings of the …, 2020 - dl.acm.org
In many public and private Cloud systems, users need to specify a limit for the amount of
resources (CPU cores and RAM) to provision for their workloads. A job that exceeds its limits …

Chameleon: scalable adaptation of video analytics

J Jiang, G Ananthanarayanan, P Bodik, S Sen… - Proceedings of the …, 2018 - dl.acm.org
Applying deep convolutional neural networks (NN) to video data at scale poses a substantial
systems challenge, as improving inference accuracy often requires a prohibitive cost in …

Optimus: an efficient dynamic resource scheduler for deep learning clusters

Y Peng, Y Bao, Y Chen, C Wu, C Guo - Proceedings of the Thirteenth …, 2018 - dl.acm.org
Deep learning workloads are common in today's production clusters due to the proliferation
of deep learning driven AI services (eg, speech recognition, machine translation). A deep …

Pocket: Elastic ephemeral storage for serverless analytics

A Klimovic, Y Wang, P Stuedi, A Trivedi… - … USENIX Symposium on …, 2018 - usenix.org
Serverless computing is becoming increasingly popular, enabling users to quickly launch
thousands of shortlived tasks in the cloud with high elasticity and fine-grain billing. These …

Machine learning for networking: Workflow, advances and opportunities

M Wang, Y Cui, X Wang, S **ao, J Jiang - Ieee Network, 2017 - ieeexplore.ieee.org
Recently, machine learning has been used in every possible field to leverage its amazing
power. For a long time, the networking and distributed computing system is the key …

A generic communication scheduler for distributed DNN training acceleration

Y Peng, Y Zhu, Y Chen, Y Bao, B Yi, C Lan… - Proceedings of the 27th …, 2019 - dl.acm.org
We present ByteScheduler, a generic communication scheduler for distributed DNN training
acceleration. ByteScheduler is based on our principled analysis that partitioning and …