A fault tolerant elastic resource management framework toward high availability of cloud services

D Saxena, I Gupta, AK Singh… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Cloud computing has become inevitable for every digital service which has exponentially
increased its usage. However, a tremendous surge in cloud resource demand stave off …

A systematic survey on fault-tolerant solutions for distributed data analytics: Taxonomy, comparison, and future directions

S Isukapalli, SN Srirama - Computer Science Review, 2024 - Elsevier
Fault tolerance is becoming increasingly important for upcoming exascale systems,
supporting distributed data processing, due to the expected decrease in the Mean Time …

A high availability management model based on VM significance ranking and resource estimation for cloud applications

D Saxena, AK Singh - IEEE Transactions on Services …, 2022 - ieeexplore.ieee.org
Massive upsurge in cloud resource usage stave off service availability resulting into
outages, resource contention, and excessive power-consumption. The existing approaches …

Cloud failure prediction based on traditional machine learning and deep learning

TN Tengku Asmawi, A Ismail, J Shen - Journal of Cloud Computing, 2022 - Springer
Cloud failure is one of the critical issues since it can cost millions of dollars to cloud service
providers, in addition to the loss of productivity suffered by industrial users. Fault tolerance …

Prioritized fault recovery strategies for multi-access edge computing using probabilistic model checking

K Ray, A Banerjee - IEEE Transactions on Dependable and …, 2022 - ieeexplore.ieee.org
The advent of Multi-Access Edge Computing (MEC) has enabled service providers to
mitigate high network latencies often encountered in accessing cloud services by deploying …

A taxonomy of security and defense mechanisms in digital twins-based cyber-physical systems

A Hussaini, C Qian, W Liao, W Yu - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
The (IoT) paradigm's fundamental goal is to massively connect the “smart things” through
standardized interfaces, providing a variety of smart services. Cyber-Physical Systems …

[PDF][PDF] Enhancement of Cloud Computing Environment Using Machine Learning Algorithms MLCE

ME Seno, BN Dhannoon, OKJ Mohammad - Iraqi Journal of Computers …, 2023 - iasj.net
Cloud computing is an evolving and high-demand research field at the forefront of
technological advancements. It aims to provide software resources and operates based on …

A combined system metrics approach to cloud service reliability using artificial intelligence

TR Chhetri, CK Dehury, A Lind, SN Srirama… - Big Data and Cognitive …, 2022 - mdpi.com
Identifying and anticipating potential failures in the cloud is an effective method for
increasing cloud reliability and proactive failure management. Many studies have been …

Autonomic rejuvenation of cloud applications as a countermeasure to software anomalies

P Di Sanzo, DR Avresky… - Software: Practice and …, 2021 - Wiley Online Library
Failures in computer systems can be often tracked down to software anomalies of various
kinds. In many scenarios, it might be difficult, unfeasible, or unprofitable to carry out …

Fault tree analysis based virtual machine migration for fault-tolerant cloud data center

GJ Leelipushpam, IJ Jebadurai… - Journal of integrated …, 2021 - journals.sagepub.com
Though cloud data center is highly adapted for flexible, scalable and highly available
computing and storage resources, it is vulnerable to failures. Predicting the occurrence of …