Faults in grids: why are they so bad and what can be done about it?
Computational grids have the potential to become the main execution platform for high
performance and distributed applications. However, such systems are extremely complex …
performance and distributed applications. However, such systems are extremely complex …
Online system for grid resource monitoring and machine learning-based prediction
L Hu, XL Che, SQ Zheng - IEEE transactions on parallel and …, 2011 - ieeexplore.ieee.org
Resource allocation and job scheduling are the core functions of grid computing. These
functions are based on adequate information of available resources. Timely acquiring …
functions are based on adequate information of available resources. Timely acquiring …
Autonomic failover of grid-based services
RP Doyle, DL Kaminsky - US Patent 7,287,179, 2007 - Google Patents
6,922,791 B2* 7/2005 Mashayekhi et al........... 714/4 failed ghd host'and Platform methes
determined for a 6,982,951 B2* 1/2006 Doverspike et a1 '______ __ 370/217 proposed …
determined for a 6,982,951 B2* 1/2006 Doverspike et a1 '______ __ 370/217 proposed …
A resource management and fault tolerance services in grid computing
HM Lee, KS Chung, SH Chin, JH Lee, DW Lee… - Journal of Parallel and …, 2005 - Elsevier
In grid computing, resource management and fault tolerance services are important issues.
The availability of the selected resources for job execution is a primary factor that determines …
The availability of the selected resources for job execution is a primary factor that determines …
An infrastructure for Grid application monitoring
In this paper, we present a concept of the OCM-Ga distributed monitoring system for
obtaining information and manipulating distributed applications running on the Grid. The …
obtaining information and manipulating distributed applications running on the Grid. The …
IT governance, risk & compliance (GRC) status quo and integration: an explorative industry case study
The integration of governance, risk, and compliance (GRC) activities has gained importance
over the last years. This paper presents an analysis of the GRC integration efforts in …
over the last years. This paper presents an analysis of the GRC integration efforts in …
[PDF][PDF] A grid monitoring architecture
Large distributed systems such as Computational and Data Grids require that a substantial
amount of monitoring data be collected for various tasks such as fault detection …
amount of monitoring data be collected for various tasks such as fault detection …
An approach to grid resource selection and fault management based on ECA rules
In grid computing, resource management and fault tolerance services are important issues.
Because the numbers of the application tasks and amounts of required resources are …
Because the numbers of the application tasks and amounts of required resources are …
A fault tolerance service for QoS in grid computing
HM Lee, K Sik Chung, SH **, DW Lee, WG Lee… - … Science—ICCS 2003 …, 2003 - Springer
This paper proposes fault tolerance service to satisfy QoS requirement in grid computing.
The probability of failure in the grid computing is higher than in a tradition parallel …
The probability of failure in the grid computing is higher than in a tradition parallel …
Resource scheduling in desktop grid by grid-JQA
In desktop grid computing, resource scheduling is an important issue. In this paper, we
propose a QoS-based resource scheduling algorithm that finds the best match between …
propose a QoS-based resource scheduling algorithm that finds the best match between …