A comprehensive perspective on pilot-job systems

M Turilli, M Santcroos, S Jha - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Pilot-Job systems play an important role in supporting distributed scientific computing. They
are used to execute millions of jobs on several cyberinfrastructures worldwide, consuming …

Design and performance characterization of radical-pilot on leadership-class platforms

A Merzky, M Turilli, M Titov… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Many extreme scale scientific applications have workloads comprised of a large number of
individual high-performance tasks. The Pilot abstraction decouples workload specification …

The cloud application modelling and execution language

AP Achilleos, K Kritikos, A Rossini… - Journal of Cloud …, 2019 - Springer
Cloud computing offers a flexible pay-as-you-go model for provisioning application
resources, which enables applications to scale on-demand based on the current workload …

P∗: a model of pilot-abstractions

A Luckow, M Santcroos, A Merzky… - 2012 IEEE 8th …, 2012 - ieeexplore.ieee.org
Pilot-Jobs support effective distributed resource utilization, and are arguably one of the most
widely-used distributed computing abstractions-as measured by the number and types of …

GWpilot: Enabling multi-level scheduling in distributed infrastructures with GridWay and pilot jobs

AJ Rubio-Montero, E Huedo, F Castejón… - Future Generation …, 2015 - Elsevier
Current systems based on pilot jobs are not exploiting all the scheduling advantages that the
technique offers, or they lack compatibility or adaptability. To overcome the limitations or …

Contributions to Computing needs in High Energy Physics Offline Activities: Towards an efficient exploitation of heterogeneous, distributed and shared Computing …

A Boyer - 2022 - theses.hal.science
Pushing the boundaries of sciences and providing more advanced services to individuals
and communities continuously demand more sophisticated software, specialized hardware …

JETS: Language and system support for many-parallel-task workflows

JM Wozniak, M Wilde, DS Katz - Journal of grid computing, 2013 - Springer
Many-task computing is a well-established paradigm for implementing loosely coupled
applications (tasks) on large-scale computing systems. However, few of the model's existing …

Reliability evaluation of standby safety systems due to independent and common cause failures

L Lu, G Lewis - 2006 IEEE International Conference on …, 2006 - ieeexplore.ieee.org
Standby redundant systems are often adopted in critical applications such as the emergency
shutdown systems (ESDS) in nuclear power plants (NPPs). One failure mode of the standby …

Abstractions for loosely-coupled and ensemble-based simulations on Azure

A Luckow, S Jha - 2010 IEEE Second International Conference …, 2010 - ieeexplore.ieee.org
Azure is an emerging cloud platform developed and operated by Microsoft. It provides a
range of abstractions and building blocks for creating scalable and reliable scientific …

JETS: Language and System Support for Many-Parallel-Task Computing

JM Wozniak, M Wilde - 2011 40th International Conference on …, 2011 - ieeexplore.ieee.org
Many-task computing is a well-established paradigm for implementing loosely coupled
applications on large-scale computing systems. However, few of the model's existing …