Performance anomaly detection and bottleneck identification

O Ibidunmoye, F Hernández-Rodriguez… - ACM Computing Surveys …, 2015 - dl.acm.org
In order to meet stringent performance requirements, system administrators must effectively
detect undesirable performance behaviours, identify potential root causes, and take …

Hitanomaly: Hierarchical transformers for anomaly detection in system log

S Huang, Y Liu, C Fung, R He, Y Zhao… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Enterprise systems often produce a large volume of logs to record runtime status and events.
Anomaly detection from system logs is crucial for service management and system …

Studying the effectiveness of application performance management (apm) tools for detecting performance regressions for web applications: an experience report

TM Ahmed, CP Bezemer, TH Chen… - Proceedings of the 13th …, 2016 - dl.acm.org
Performance regressions, such as a higher CPU utilization than in the previous version of an
application, are caused by software application updates that negatively affect the …

Bert-log: Anomaly detection for system logs based on pre-trained language model

S Chen, H Liao - Applied Artificial Intelligence, 2022 - Taylor & Francis
Logs are primary information resource for fault diagnosis and anomaly detection in large-
scale computer systems, but it is hard to classify anomalies from system logs. Recent studies …

Root cause detection in a service-oriented architecture

M Kim, R Sumbaly, S Shah - ACM SIGMETRICS Performance Evaluation …, 2013 - dl.acm.org
Large-scale websites are predominantly built as a service-oriented architecture. Here,
services are specialized for a certain task, run on multiple machines, and communicate with …

Prepare: Predictive performance anomaly prevention for virtualized cloud systems

Y Tan, H Nguyen, Z Shen, X Gu… - 2012 IEEE 32nd …, 2012 - ieeexplore.ieee.org
Virtualized cloud systems are prone to performance anomalies due to various reasons such
as resource contentions, software bugs, and hardware failures. In this paper, we present a …

Performance-aware management of cloud resources: A taxonomy and future directions

SK Moghaddam, R Buyya… - ACM Computing Surveys …, 2019 - dl.acm.org
The dynamic nature of the cloud environment has made the distributed resource
management process a challenge for cloud service providers. The importance of …

Speech dereverberation via maximum-kurtosis subband adaptive filtering

BW Gillespie, HS Malvar… - 2001 IEEE International …, 2001 - ieeexplore.ieee.org
This paper presents an efficient algorithm for high-quality speech capture in applications
such as hands-free teleconferencing or voice recording by personal computers. We process …

Causeinfer: Automatic and distributed performance diagnosis with hierarchical causality graph in large distributed systems

P Chen, Y Qi, P Zheng, D Hou - IEEE INFOCOM 2014-IEEE …, 2014 - ieeexplore.ieee.org
Modern applications especially cloud-based or cloud-centric applications always have many
components running in the large distributed environment with complex interactions. They …

Workflow-aware automatic fault diagnosis for microservice-based applications with statistics

T Wang, W Zhang, J Xu, Z Gu - IEEE Transactions on Network …, 2020 - ieeexplore.ieee.org
Microservice architectures bring many benefits, eg, faster delivery, improved scalability, and
greater autonomy, so they are widely adopted to develop and operate Internet-based …