Adaptive QoS-aware microservice deployment with excessive loads via intra-and inter-datacenter scheduling

J Shi, K Fu, J Wang, Q Chen, D Zeng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
User-facing applications often experience excessive loads and are shifting towards the
microservice architecture. To fully utilize heterogeneous resources, current datacenters have …

Nodens: Enabling Resource Efficient and Fast {QoS} Recovery of Dynamic Microservice Applications in Datacenters

J Shi, H Zhang, Z Tong, Q Chen, K Fu… - 2023 USENIX Annual …, 2023 - usenix.org
Current microservice applications always meet with load and call graph dynamics. These
dynamics can easily lead to inappropriate resource allocation for microservices, and further …

Serverless Cold Starts and Where to Find Them

A Joosen, A Hassan, M Asenov, R Singh… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper releases and analyzes a month-long trace of 85 billion user requests and 11.9
million cold starts from Huawei's serverless cloud platform. Our analysis spans workloads …

Expeditious {High-Concurrency}{MicroVM}{SnapStart} in Persistent Memory with an Augmented Hypervisor

X Pang, Y Zhang, L Liu, D Cheng, C Xu… - 2024 USENIX Annual …, 2024 - usenix.org
The industry has embraced snapshotting to tackle the cold starts and efficiently manage
numerous short-lived functions for microservice-native architectures, serverless computing …

Topology-Aware Self-Adaptive Resource Provisioning for Microservices

H Zeng, T Wang, A Li, Y Wu, H Wu… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Microservice architecture is a popular technology for deploying services in cloud computing,
with benefits like loose coupling, high fault tolerance, and scalability. The heterogeneous …

FaaSGraph: Enabling Scalable, Efficient, and Cost-Effective Graph Processing with Serverless Computing

Y Liu, S Sun, Z Li, Q Chen, S Gao, B He, C Li… - Proceedings of the 29th …, 2024 - dl.acm.org
Graph processing is widely used in cloud services; however, current frameworks face
challenges in efficiency and cost-effectiveness when deployed under the Infrastructure-as-a …

Towards Cloud Efficiency with Large-scale Workload Characterization

A Parayil, J Zhang, X Qin, Í Goiri, L Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
Cloud providers introduce features (eg, Spot VMs, Harvest VMs, and Burstable VMs) and
optimizations (eg, oversubscription, auto-scaling, power harvesting, and overclocking) to …

Workload Intelligence: Punching Holes Through the Cloud Abstraction

L Huang, A Parayil, J Zhang, X Qin, C Bansal… - arxiv preprint arxiv …, 2024 - arxiv.org
Today, cloud workloads are essentially opaque to the cloud platform. Typically, the only
information the platform receives is the virtual machine (VM) type and possibly a decoration …

Learning to Score: Tuning Cluster Schedulers through Reinforcement Learning

M Asenov, Q Deng, G Yeung… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Efficiently allocating incoming jobs to nodes in large-scale clusters can lead to substantial
improvements in both cluster utilization and job performance. In order to allocate incoming …

A Microservice Graph Generator with Production Characteristics

F Du, J Shi, Q Chen, L Li, M Guo - arxiv preprint arxiv:2412.19083, 2024 - arxiv.org
A production microservice application may provide multiple services, queries of a service
may have different call graphs, and a microservice may be shared across call graphs. It is …