Nezha: Interpretable fine-grained root causes analysis for microservices on multi-modal observability data

G Yu, P Chen, Y Li, H Chen, X Li, Z Zheng - Proceedings of the 31st …, 2023 - dl.acm.org
Root cause analysis (RCA) in large-scale microservice systems is a critical and challenging
task. To understand and localize root causes of unexpected faults, modern observability …

Uac-ad: Unsupervised adversarial contrastive learning for anomaly detection on multi-modal data in microservice systems

H Liu, X Huang, M Jia, T Jia, J Han… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
To ensure the stability and reliability of microservice systems, timely and accurate anomaly
detection is of utmost importance. Recently, considering the lack of labels in real-world …

Trastrainer: Adaptive sampling for distributed traces with system runtime state

H Huang, X Zhang, P Chen, Z He, Z Chen… - Proceedings of the …, 2024 - dl.acm.org
Distributed tracing has been widely adopted in many microservice systems and plays an
important role in monitoring and analyzing the system. However, trace data often come in …

MADMM: microservice system anomaly detection via multi-modal data and multi-feature extraction

P Wang, X Zhang, Z Cao, Z Chen - Neural Computing and Applications, 2024 - Springer
Accurately detecting anomalies in microservice systems is crucial to avoid system failures
and economic losses for users. Existing approaches detect anomalies by extracting …

Exploring logic scoring of preference for dos attack detection in microservice applications

J Castro, N Laranjeiro, M Vieira - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Microservice architectures allow the development of highly scalable, flexible, and
manageable systems. However, such architectures raise new security problems and …

MLAD: A Unified Model for Multi-system Log Anomaly Detection

R Zang, H Guo, J Yang, J Liu, Z Li, T Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org
In spite of the rapid advancements in unsupervised log anomaly detection techniques, the
current mainstream models still necessitate specific training for individual system datasets …

Unsupervised microservice system anomaly detection via contrastive multi-modal representation clustering

P Wang, X Zhang, Y Chen, Z Cao - Information Processing & Management, 2025 - Elsevier
Anomaly detection in microservice systems is crucial for ensuring system stability and
reliability. Existing methods rely solely on a single type of monitoring data (eg, metrics or …

Generic and robust root cause localization for multi-dimensional data in online service systems

Z Li, J Chen, Y Chen, C Luo, Y Zhao, Y Sun… - Journal of Systems and …, 2023 - Elsevier
Localizing root causes for multi-dimensional data is critical to ensure online service systems'
reliability. When a fault occurs, only the measure values within specific attribute …

MicroOps: Rapid Microservice Data Simulation and AIOps Model Development Platform

Y Li, Z Wang, Q Qi, Y **g, J Wu, Z Wu… - … on Software Analysis …, 2024 - ieeexplore.ieee.org
Artificial Intelligence for IT Operations (AIOps) for microservice systems has attracted much
attention in academia and industry, aiming to reduce the burden of operations developers …

Dependable Microservices in the Kubernetes era: A Practitioners Survey

VJS Souza, VO Neves… - Journal of Internet …, 2024 - journals-sol.sbc.org.br
The microservices architectural style offers several advantages to software development,
including independence among development teams, greater autonomy for developers …