Performance-aware management of cloud resources: A taxonomy and future directions

SK Moghaddam, R Buyya… - ACM Computing Surveys …, 2019 - dl.acm.org
The dynamic nature of the cloud environment has made the distributed resource
management process a challenge for cloud service providers. The importance of …

Online diagnosis of performance variation in HPC systems using machine learning

O Tuncer, E Ates, Y Zhang, A Turk… - … on Parallel and …, 2018 - ieeexplore.ieee.org
As the size and complexity of high performance computing (HPC) systems grow in line with
advancements in hardware and software technology, HPC systems increasingly suffer from …

Clouddet: Interactive visual analysis of anomalous performances in cloud computing systems

K Xu, Y Wang, L Yang, Y Wang, B Qiao… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Detecting and analyzing potential anomalous performances in cloud computing systems is
essential for avoiding losses to customers and ensuring the efficient operation of the …

Anomaly Detection in Microservice-Based Systems

J Nobre, EJS Pires, A Reis - Applied Sciences, 2023 - mdpi.com
Currently, distributed software systems have evolved at an unprecedented pace. Modern
software-quality requirements are high and require significant staff support and effort. This …

Predicting failures in multi-tier distributed systems

L Mariani, M Pezzè, O Riganelli, R **n - Journal of Systems and Software, 2020 - Elsevier
Many applications are implemented as multi-tier software systems, and are executed on
distributed infrastructures, like cloud infrastructures, to benefit from the cost reduction that …

On the effectiveness of isolation‐based anomaly detection in cloud data centers

RN Calheiros, K Ramamohanarao… - Concurrency and …, 2017 - Wiley Online Library
The high volume of monitoring information generated by large‐scale cloud infrastructures
poses a challenge to the capacity of cloud providers in detecting anomalies in the …

A new weighted fuzzy C-means clustering for workload monitoring in cloud datacenter platforms

S El Motaki, A Yahyaouy, H Gualous, J Sabor - Cluster Computing, 2021 - Springer
The rapid growth in virtualization solutions has driven the widespread adoption of cloud
computing paradigms among various industries and applications. This has led to a growing …

E2EWatch: an end-to-end anomaly diagnosis framework for production HPC systems

B Aksar, B Schwaller, O Aaziz, VJ Leung… - Euro-Par 2021: Parallel …, 2021 - Springer
Abstract In today's High-Performance Computing (HPC) systems, application performance
variations are among the most vital challenges as they adversely affect system efficiency …

Analysis of load balancing detection methods using hidden markov model for secured cloud computing environment

M Arvindhan, D Rajesh Kumar - Applications of Computational Methods in …, 2022 - Springer
Cloud computing is rapidly growing nowadays and stands a creative computing paradigm
which provides Internet facilities to meet users' storage needs. Various interconnected …

Anthropomorphic diagnosis of runtime hidden behaviors in OpenMP multi-threaded applications

W Wang, D Li, W Luo, Y Kang, L Wang - Journal of Parallel and Distributed …, 2023 - Elsevier
Extreme-scale computing involves hundreds of millions of threads with multi-level
parallelism running on large-scale hierarchical and heterogeneous hardware. Some …