Root cause analysis of failures in microservices through causal discovery

A Ikram, S Chakraborty, S Mitra… - Advances in …, 2022 - proceedings.neurips.cc
Most cloud applications use a large number of smaller sub-components (called
microservices) that interact with each other in the form of a complex graph to provide the …

Adaptive resource efficient microservice deployment in cloud-edge continuum

K Fu, W Zhang, Q Chen, D Zeng… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
User-facing services are now evolving towards the microservice architecture where a
service is built by connecting multiple microservice stages. Since the entire service is heavy …

The power of prediction: microservice auto scaling via workload learning

S Luo, H Xu, K Ye, G Xu, L Zhang, G Yang… - Proceedings of the 13th …, 2022 - dl.acm.org
When deploying microservices in production clusters, it is critical to automatically scale
containers to improve cluster utilization and ensure service level agreements (SLA) …

An in-depth study of microservice call graph and runtime performance

S Luo, H Xu, C Lu, K Ye, G Xu, L Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Loosely-coupled and light-weight microservices running in containers are replacing
monolithic applications gradually. Understanding the characteristics of microservices is …

Qos-aware and resource efficient microservice deployment in cloud-edge continuum

K Fu, W Zhang, Q Chen, D Zeng, X Peng… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
User-facing services are now evolving towards the microservice architecture where a
service is built by connecting multiple microservice stages. While an entire service is heavy …

Erms: Efficient resource management for shared microservices with sla guarantees

S Luo, H Xu, K Ye, G Xu, L Zhang, J He… - Proceedings of the 28th …, 2022 - dl.acm.org
A common approach to improving resource utilization in data centers is to adaptively
provision resources based on the actual workload. One fundamental challenge of doing this …

Understanding, predicting and scheduling serverless workloads under partial interference

L Zhao, Y Yang, Y Li, X Zhou, K Li - Proceedings of the International …, 2021 - dl.acm.org
Interference among distributed cloud applications can be classified into three types: full,
partial and zero. While prior research merely focused on full interference, the partial …

Take it to the limit: peak prediction-driven resource overcommitment in datacenters

N Bashir, N Deng, K Rzadca, D Irwin, S Kodak… - Proceedings of the …, 2021 - dl.acm.org
To increase utilization, datacenter schedulers often overcommit resources where the sum of
resources allocated to the tasks on a machine exceeds its physical capacity. Setting the right …

Containerized microservices: A survey of resource management frameworks

LM Al Qassem, T Stouraitis, E Damiani… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The growing adoption of microservice architectures (MSAs) has led to major research and
development efforts to address their challenges and improve their performance, reliability …

Delay-aware optimization of fine-grained microservice deployment and routing in edge via reinforcement learning

K Peng, J He, J Guo, Y Liu, J He… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Microservices have exerted a profound impact on the development of internet applications.
Meanwhile, the growing number of mobile terminal user requests has made the …