Energy efficient fault tolerance techniques in green cloud computing: A systematic survey and taxonomy

S Bharany, S Badotra, S Sharma, S Rani… - Sustainable Energy …, 2022 - Elsevier
Cloud computing has brought the accessibility of several software platforms under a single
roof. It has transformed resources into scalable services on demand and provides the only …

Towards Resilient Method: An exhaustive survey of fault tolerance methods in the cloud computing environment

MA Shahid, N Islam, MM Alam, MS Mazliham… - Computer Science …, 2021 - Elsevier
Fault Tolerance (FT) is one of the cloud's very critical problems for providing security
assistance. Due to the diverse service architecture, detailed architectures & multiple …

RUAD: Unsupervised anomaly detection in HPC systems

M Molan, A Borghesi, D Cesarini, L Benini… - Future Generation …, 2023 - Elsevier
The increasing complexity of modern high-performance computing (HPC) systems
necessitates the introduction of automated and data-driven methodologies to support system …

M100 exadata: a data collection campaign on the cineca's marconi100 tier-0 supercomputer

A Borghesi, C Di Santi, M Molan, MS Ardebili, A Mauri… - Scientific Data, 2023 - nature.com
Supercomputers are the most powerful computing machines available to society. They play
a central role in economic, industrial, and societal development. While they are used by …

Achieving reliability in cloud computing by a novel hybrid approach

MA Shahid, MM Alam, MM Su'ud - Sensors, 2023 - mdpi.com
Cloud computing (CC) benefits and opportunities are among the fastest growing
technologies in the computer industry. Cloud computing's challenges include resource …

Anomaly detection and anticipation in high performance computing systems

A Borghesi, M Molan, M Milano… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In their quest toward Exascale, High Performance Computing (HPC) systems are rapidly
becoming larger and more complex, together with the issues concerning their maintenance …

Examon-x: a predictive maintenance framework for automatic monitoring in industrial iot systems

A Borghesi, A Burrello, A Bartolini - IEEE Internet of Things …, 2021 - ieeexplore.ieee.org
In recent years, the Industrial Internet of Things (IIoT) has led to significant steps forward in
many industries, thanks to the exploitation of several technologies, ranging from Big Data …

Fault detection and control in integrated energy system using machine learning

P Wang, P Poovendran, KB Manokaran - Sustainable Energy Technologies …, 2021 - Elsevier
Abstract Integrated Energy System (IES), which covers electricity/gas/heat and other different
energy sources, is an integral source of energy and Fault Detection in dynamic processing …

Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm

M Asim Shahid, MM Alam, M Mohd Su'ud - Plos one, 2023 - journals.plos.org
The benefits and opportunities offered by cloud computing are among the fastest-growing
technologies in the computer industry. Additionally, it addresses the difficulties and issues …

Adaptive feature selection for predicting application performance degradation in edge cloud environments

B Shayesteh, C Fu, A Ebrahimzadeh… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Applications deployed in edge cloud environments can have stringent requirements such as
high throughput and high availability. However, these applications may suffer from …