A survey on checkpointing strategies: Should we always checkpoint à la Young/Daly?
Abstract The Young/Daly formula provides an approximation of the optimal checkpointing
period for a parallel application executing on a supercomputing platform. It was originally …
period for a parallel application executing on a supercomputing platform. It was originally …
SLATE: Design of a modern distributed and accelerated linear algebra library
The SLATE (Software for Linear Algebra Targeting Exascale) library is being developed to
provide fundamental dense linear algebra capabilities for current and upcoming distributed …
provide fundamental dense linear algebra capabilities for current and upcoming distributed …
Accelerating geostatistical modeling and prediction with mixed-precision computations: A high-productivity approach with parsec
Geostatistical modeling, one of the prime motivating applications for exascale computing, is
a technique for predicting desired quantities from geographically distributed data, based on …
a technique for predicting desired quantities from geographically distributed data, based on …
Extreme-scale task-based cholesky factorization toward climate and weather prediction applications
Climate and weather can be predicted statistically via geospatial Maximum Likelihood
Estimates (MLE), as an alternative to running large ensembles of forward models. The MLE …
Estimates (MLE), as an alternative to running large ensembles of forward models. The MLE …
Celerity: High-level c++ for accelerator clusters
In the face of ever-slowing single-thread performance growth for CPUs, the scientific and
engineering communities increasingly turn to accelerator parallelization to tackle growing …
engineering communities increasingly turn to accelerator parallelization to tackle growing …
Evolution of the SLATE linear algebra library
SLATE (Software for Linear Algebra Targeting Exascale) is a distributed, dense linear
algebra library targeting both CPU-only and GPU-accelerated systems, developed over the …
algebra library targeting both CPU-only and GPU-accelerated systems, developed over the …
Towards extreme scale technologies and accelerators for eurohpc hw/sw supercomputing applications for exascale: the textarossa approach
In the near future, Exascale systems will need to bridge three technology gaps to achieve
high performance while remaining under tight power constraints: energy efficiency and …
high performance while remaining under tight power constraints: energy efficiency and …
Scaling implicit parallelism via dynamic control replication
We present dynamic control replication, a run-time program analysis that enables scalable
execution of implicitly parallel programs on large machines through a distributed and …
execution of implicitly parallel programs on large machines through a distributed and …
A 2D hydrodynamic model for shallow water flows with significant infiltration losses
Y Ni, Z Cao, Q Liu, Q Liu - Hydrological Processes, 2020 - Wiley Online Library
Infiltration losses may be significant and warrant proper incorporation into mathematical
models for river floods in arid and semi‐arid areas, rainfall‐induced surface runoffs in …
models for river floods in arid and semi‐arid areas, rainfall‐induced surface runoffs in …
Exploiting data sparsity for large-scale matrix computations
Exploiting data sparsity in dense matrices is an algorithmic bridge between architectures
that are increasingly memory-austere on a per-core basis and extreme-scale applications. In …
that are increasingly memory-austere on a per-core basis and extreme-scale applications. In …