[PDF][PDF] Load balancing, fault tolerance, and resource elasticity for asynchronous many-task systems

J Posner - 2021 - kobra.uni-kassel.de
Abstract High-Performance Computing (HPC) enables solving complex problems from
various scientific fields including key societal problems such as COVID-19. Recently …

[CITATION][C] Load Balancing, Fault Tolerance, and Resource Elasticity for Asynchronous Many-Task Systems

C Fohry, M Schulz