Safe inspection of live virtual machines

S Suneja, R Koller, C Isci, E De Lara, A Hashemi… - ACM SIGPLAN …, 2017 - dl.acm.org
With DevOps automation and an everything-as-code approach to lifecycle management for
cloud-native applications, challenges emerge from an operational visibility and control …

Piccolo: A fast and efficient rollback system for virtual machine clusters

L Cui, Z Hao, Y Peng, X Yun - IEEE Transactions on Parallel …, 2017 - ieeexplore.ieee.org
Rollback is an effective technique to resume the system execution from a recorded
intermediate state upon failures, without having to restart the entire system. However, in …

eHotSnap: an efficient and hot distributed snapshots system for virtual machine cluster

B Li, L Cui, Z Hao, L Li, Y Liu, Y Li - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the popularity of IaaS clouds, many distributed and networked applications are running
in virtual machine cluster (VMC). The distributed snapshots of VMC are a practical approach …

A transparent hypervisor-level checkpoint-restart mechanism for a cluster of virtual machines

C Pechwises, K Chanchio - 2018 15th International Joint …, 2018 - ieeexplore.ieee.org
A cluster of virtual machines is a common platform for running MPI applications in cloud
computing environments. However, most traditional methods to provide fault tolerance to …

iConSnap: An incremental continuous snapshots system for virtual machines

Z Hao, W Wang, L Cui, X Yun… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
The reliability of data and services hosted on a virtual machine (VM) is a top concern in
cloud environments. The Continuous Snapshots can reduce the data loss in case of failures …

An In-Memory Checkpoint-Restart mechanism for a cluster of virtual machines

J Yaothanee, K Chanchio - 2019 16th International Joint …, 2019 - ieeexplore.ieee.org
A cluster of virtual machines can be used to execute parallel applications in Cloud
Computing environments. However, the cloud infrastructure may fail at any time for a variety …

[PDF][PDF] A checkpointing mechanism for virtual clusters using memory-bound time-multiplexed data transfers

J Yaothanee, K Chanchio - International Journal of Electrical and …, 2024 - academia.edu
Transparent hypervisor-level checkpoint-restart mechanisms for virtual clusters (VCs) or
clusters of virtual machines (VMs) offer an attractive fault tolerance capability for cloud data …

A distributed snapshot protocol for efficient artificial intelligence computation in cloud computing environments

JB Lim, JM Gil, HC Yu - Symmetry, 2018 - mdpi.com
Many artificial intelligence applications often require a huge amount of computing resources.
As a result, cloud computing adoption rates are increasing in the artificial intelligence field …