Performance anomaly detection and bottleneck identification

O Ibidunmoye, F Hernández-Rodriguez… - ACM Computing Surveys …, 2015 - dl.acm.org
In order to meet stringent performance requirements, system administrators must effectively
detect undesirable performance behaviours, identify potential root causes, and take …

Tales of the tail: Hardware, os, and application-level sources of tail latency

J Li, NK Sharma, DRK Ports, SD Gribble - Proceedings of the ACM …, 2014 - dl.acm.org
Interactive services often have large-scale parallel implementations. To deliver fast
responses, the median and tail latencies of a service's components must be low. In this …

Automap: Diagnose your microservice-based web applications automatically

M Ma, J Xu, Y Wang, P Chen, Z Zhang… - Proceedings of The Web …, 2020 - dl.acm.org
The high complexity and dynamics of the microservice architecture make its application
diagnosis extremely challenging. Static troubleshooting approaches may fail to obtain …

Performance modeling for cloud microservice applications

A **dal, V Podolskiy, M Gerndt - Proceedings of the 2019 ACM/SPEC …, 2019 - dl.acm.org
Microservices enable a fine-grained control over the cloud applications that they constitute
and thus became widely-used in the industry. Each microservice implements its own …

Ms-rank: Multi-metric and self-adaptive root cause diagnosis for microservice applications

M Ma, W Lin, D Pan, P Wang - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
This paper presents a self-adaptive root cause diagnosis framework, named MS-Rank, to
analyze multiple metrics collected from micro-service architecture. MS-Rank decomposes …

Grunt attack: Exploiting execution dependencies in microservices

X Gu, Q Wang, J Liu, J Wei - 2024 54th Annual IEEE/IFIP …, 2024 - ieeexplore.ieee.org
Loosely-coupled and lightweight microservices running in containers are likely to form
complex execution dependencies inside the system. The execution dependency arises …

PEAS: A performance evaluation framework for auto-scaling strategies in cloud applications

AV Papadopoulos, A Ali-Eldin, KE Årzén… - ACM Transactions on …, 2016 - dl.acm.org
Numerous auto-scaling strategies have been proposed in the past few years for improving
various Quality of Service (QoS) indicators of cloud applications, for example, response time …

Servicerank: Root cause identification of anomaly in large-scale microservice architectures

M Ma, W Lin, D Pan, P Wang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Nowadays, increasing business applications running in the cloud are embracing the
microservice architecture. This article presents the challenges and implications of …

Q-zilla: A scheduling framework and core microarchitecture for tail-tolerant microservices

A Mirhosseini, BL West, GW Blake… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Managing tail latency is a primary challenge in designing large-scale Internet services.
Queuing is a major contributor to end-to-end tail latency, wherein nominal tasks are …

Software data planes: You can't always spin to win

H Golestani, A Mirhosseini, TF Wenisch - Proceedings of the ACM …, 2019 - dl.acm.org
Today's datacenters demand high-performance, energy-efficient software data planes,
which are widely used in many areas including fast network packet processing, network …