Kubernetes scheduling: Taxonomy, ongoing issues and challenges

C Carrión - ACM Computing Surveys, 2022‏ - dl.acm.org
Continuous integration enables the development of microservices-based applications using
container virtualization technology. Container orchestration systems such as Kubernetes …

Serverless computing: state-of-the-art, challenges and opportunities

Y Li, Y Lin, Y Wang, K Ye, C Xu - IEEE Transactions on …, 2022‏ - ieeexplore.ieee.org
Serverless computing is growing in popularity by virtue of its lightweight and simplicity of
management. It achieves these merits by reducing the granularity of the computing unit to …

A petavoxel fragment of human cerebral cortex reconstructed at nanoscale resolution

A Shapson-Coe, M Januszewski, DR Berger, A Pope… - Science, 2024‏ - science.org
To fully understand how the human brain works, knowledge of its structure at high resolution
is needed. Presented here is a computationally intensive reconstruction of the ultrastructure …

Pond: Cxl-based memory pooling systems for cloud platforms

H Li, DS Berger, L Hsu, D Ernst, P Zardoshti… - Proceedings of the 28th …, 2023‏ - dl.acm.org
Public cloud providers seek to meet stringent performance requirements and low hardware
cost. A key driver of performance and cost is main memory. Memory pooling promises to …

Llumnix: Dynamic scheduling for large language model serving

B Sun, Z Huang, H Zhao, W **ao, X Zhang… - … USENIX Symposium on …, 2024‏ - usenix.org
Inference serving for large language models (LLMs) is the key to unleashing their potential
in people's daily lives. However, efficient LLM serving remains challenging today because …

{SkyPilot}: An intercloud broker for sky computing

Z Yang, Z Wu, M Luo, WL Chiang, R Bhardwaj… - … USENIX Symposium on …, 2023‏ - usenix.org
To comply with the increasing number of government regulations about data placement and
processing, and to protect themselves against major cloud outages, many users want the …

Borg: the next generation

M Tirmazi, A Barker, N Deng, ME Haque… - Proceedings of the …, 2020‏ - dl.acm.org
This paper analyzes a newly-published trace that covers 8 different Borg [35] clusters for the
month of May 2019. The trace enables researchers to explore how scheduling works in …

Carbon-aware computing for datacenters

A Radovanović, R Koningstein… - … on Power Systems, 2022‏ - ieeexplore.ieee.org
The amount of CO emitted per kilowatt-hour on an electricity grid varies by time of day and
substantially varies by location due to the types of generation. Networked collections of …

Autopilot: workload autoscaling at google

K Rzadca, P Findeisen, J Swiderski, P Zych… - Proceedings of the …, 2020‏ - dl.acm.org
In many public and private Cloud systems, users need to specify a limit for the amount of
resources (CPU cores and RAM) to provision for their workloads. A job that exceeds its limits …

An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems

Y Gan, Y Zhang, D Cheng, A Shetty, P Rathi… - Proceedings of the …, 2019‏ - dl.acm.org
Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …