Failure diagnosis in microservice systems: A comprehensive survey and analysis

S Zhang, S **a, W Fan, B Shi, X **ong… - ACM Transactions on …, 2024 - dl.acm.org
Widely adopted for their scalability and flexibility, modern microservice systems present
unique failure diagnosis challenges due to their independent deployment and dynamic …

Trustworthy AI-based Performance Diagnosis Systems for Cloud Applications: A Review

R **n, J Wang, P Chen, Z Zhao - ACM Computing Surveys, 2025 - dl.acm.org
Performance diagnosis systems are defined as detecting abnormal performance
phenomena and play a crucial role in cloud applications. An effective performance …

Zoom-inRCL: Fine-grained root cause localization for B5G/6G network slicing

Y Tan, J Liu, J Wang - Computer Networks, 2025 - Elsevier
Network slicing, a cornerstone technology for the evolving B5G/6G, comprises the Network
Function Virtualization Infrastructure (NFVI) layer, the network slice instance layer, the …

Patternrca: A pattern-aware root cause analysis framework for multi-dimensional time series

C He, F Tian, P Xue, Y Wu, Y Li, J Li… - … Conference on Data …, 2023 - ieeexplore.ieee.org
Root cause analysis for multi-dimensional time series from large scale micro-service
scenarios aims at identifying the set of anomaly attributes by monitoring operational metrics …

Multi-source KPIs' root cause localization in online service systems

H **a, J Xu, B **ao, H Jia, C Gao… - … on Networking and …, 2024 - ieeexplore.ieee.org
Root cause localization is challenging because of the large number of monitoring metrics
and the many types of faults in an online service system extended by a microservices …

[KNJIGA][B] Towards effective performance diagnosis for distributed applications

R **n - 2023 - core.ac.uk
Cloud computing provides elastic and on-demand resources for customizing data storage,
processing, and communication, transforming how software applications are developed …