Analyzing and predicting job failures from HPC system log

JW Park, X Huang, CH Lee - The Journal of Supercomputing, 2024 - Springer
In this paper, we analyze the scheduler log of a production supercomputer that contains
complete job information, which is in contrast to many existing (publicly available) HPC logs …

A survey on scheduling heuristics in grid computing environment

MK Mishra, YS Patel, Y Rout… - International Journal of …, 2014 - search.proquest.com
Job scheduling is one of the thrust research area in the discipline of Grid computing.
Scheduling in the Grid environment is not only complicated but also known to be NP …

Economic issues in shared infrastructures

C Courcoubetis, RR Weber - Proceedings of the 1st ACM workshop on …, 2009 - dl.acm.org
We define some interesting incentive issues that arise in the management of virtual
infrastructures. We demonstrate that participants' decisions about the quantities of …

Queue waiting time prediction for large-scale high-performance computing system

JW Park - 2019 International Conference on High Performance …, 2019 - ieeexplore.ieee.org
Traditionally, high-performance computing (HPC) systems have been extensively utilized in
many science fields including big data analysis and machine learning. Such large-scale …

From volunteer to trustable computing: Providing QoS-aware scheduling mechanisms for multi-grid computing environments

J Conejero, B Caminero, C Carrión, L Tomás - Future Generation …, 2014 - Elsevier
The exploitation of service oriented technologies, such as Grid computing, is being boosted
by the current service oriented economy trend, leading to a growing need of Quality of …

Providing grid services based on virtualization and cloud technologies

JL Cacheiro, C Fernández, E Freire, S Díaz… - Euro-Par 2009–Parallel …, 2010 - Springer
CESGA is operating a totally virtualized grid infrastructure that supports several production
sites for different grid projects (EGEE, EELA, int. eu. grid (I2G), Ibergrid, and other regional …

Divide-and-conquer strategies for large-scale simulations in R

H Zhang, Y Zhong, J Lin - … Conference on Big Data (Big Data), 2017 - ieeexplore.ieee.org
As the volume of data and technical complexity of large-scale analysis increases, many
domain experts desire a computational powerful but still familiar analysis interface to fully …

Using Virtualization Approaches to Solve Deep Learning Problems in Voluntary Distributed Computing Projects

I Kurochkin, V Papanov - Russian Supercomputing Days, 2023 - Springer
The task of training deep neural networks on a large amount of data requires a lot of
resources. The solution of such a problem is often impossible to carry out on one computing …

[PDF][PDF] Grid Scheduling Optimization Based on Resource Characteristics

A Aggarwal, P Du, D Robert - Journal of Computational …, 2010 - researchgate.net
Scheduling is an active research area in the Computational Grid environment. The objective
of grid scheduling is both to deliver the Quality of Service (QoS) requirements of the grid …

[PDF][PDF] Evaluating grid infrastructure for natural language processing

R Garabík, JJ Javoršek… - GCCP 2010 BOOK OF …, 2010 - conference.ui.sav.sk
Increasing computing requirements for acquiring and processing large data-sets and
working with big corpora in Natural Language Processing (NLP) and related disciplines …