Unsupervised detection of microservice trace anomalies through service-level deep bayesian networks

P Liu, H Xu, Q Ouyang, R Jiao, Z Chen… - 2020 IEEE 31st …, 2020 - ieeexplore.ieee.org
The anomalies of microservice invocation traces (traces) often indicate that the quality of the
microservice-based large software service is being impaired. However, timely and …

Opprentice: Towards practical and automatic anomaly detection through machine learning

D Liu, Y Zhao, H Xu, Y Sun, D Pei, J Luo… - Proceedings of the …, 2015 - dl.acm.org
Closely monitoring service performance and detecting anomalies are critical for Internet-
based services. However, even though dozens of anomaly detectors have been proposed …

Identifying bad software changes via multimodal anomaly detection for online service systems

N Zhao, J Chen, Z Yu, H Wang, J Li, B Qiu… - Proceedings of the 29th …, 2021 - dl.acm.org
In large-scale online service systems, software changes are inevitable and frequent. Due to
importing new code or configurations, changes are likely to incur incidents and destroy user …

Robust and rapid adaption for concept drift in software system anomaly detection

M Ma, S Zhang, D Pei, X Huang… - 2018 IEEE 29th …, 2018 - ieeexplore.ieee.org
Anomaly detection is critical for web-based software systems. Anecdotal evidence suggests
that in these systems, the accuracy of a static anomaly detection method that was previously …

Syslog processing for switch failure diagnosis and prediction in datacenter networks

S Zhang, W Meng, J Bu, S Yang, Y Liu… - 2017 IEEE/ACM 25th …, 2017 - ieeexplore.ieee.org
Syslogs on switches are a rich source of information for both post-mortem diagnosis and
proactive prediction of switch failures in a datacenter network. However, such information …

Interpretable Failure Localization for Microservice Systems Based on Graph Autoencoder

Y Sun, Z Lin, B Shi, S Zhang, S Ma, P **… - ACM Transactions on …, 2024 - dl.acm.org
Accurate and efficient localization of root cause instances in large-scale microservice
systems is of paramount importance. Unfortunately, prevailing methods face several …

Spatio-temporal factorization of log data for understanding network events

T Kimura, K Ishibashi, T Mori, H Sawada… - … -IEEE Conference on …, 2014 - ieeexplore.ieee.org
Understanding the impacts and patterns of network events such as link flaps or hardware
errors is crucial for diagnosing network anomalies. In large production networks, analyzing …

Auric: using data-driven recommendation to automatically generate cellular configuration

A Mahimkar, A Sivakumar, Z Ge, S Pathak… - Proceedings of the 2021 …, 2021 - dl.acm.org
Cellular service providers add carriers in the network in order to support the increasing
demand in voice and data traffic and provide good quality of service to the users. Addition of …

Measurement and analysis on the packet delivery performance in a large-scale sensor network

W Dong, Y Liu, Y He, T Zhu… - IEEE/ACM Transactions …, 2013 - ieeexplore.ieee.org
Understanding the packet delivery performance of a wireless sensor network (WSN) is
critical for improving system performance and exploring future developments and …

Robust network compressive sensing

YC Chen, L Qiu, Y Zhang, G Xue, Z Hu - Proceedings of the 20th annual …, 2014 - dl.acm.org
Networks are constantly generating an enormous amount of rich diverse information. Such
information creates exciting opportunities for network analytics. However, a major challenge …