A note on auto-tuning GEMM for GPUs

Y Li, J Dongarra, S Tomov - … : 9th international conference Baton Rouge, la …, 2009 - Springer
The development of high performance dense linear algebra (DLA) critically depends on
highly optimized BLAS, and especially on the matrix multiplication routine (GEMM). This is …

The reliability wall for exascale supercomputing

X Yang, Z Wang, J Xue, Y Zhou - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
Reliability is a key challenge to be understood to turn the vision of exascale supercomputing
into reality. Inevitably, large-scale supercomputing systems, especially those at the …

Simulation System and Method

AK Usadi, N Fedorava, S Terekhov… - US Patent App. 12 …, 2010 - Google Patents
(57) ABSTRACT A method and system are described that enhance the compu tational
simulation, such as a? uid? owing through a porous media, under the present techniques. In …

Multifrontal factorization of sparse SPD matrices on GPUs

T George, V Saxena, A Gupta, A Singh… - … Parallel & Distributed …, 2011 - ieeexplore.ieee.org
Solving large sparse linear systems is often the most computationally intensive component
of many scientific computing applications. In the past, sparse multifrontal direct factorization …

[HTML][HTML] Automated linear solver selection for simulation of multiphysics processes in porous media

Y Zabegaev, E Keilegavlen, E Iversen, I Berre - Computer Methods in …, 2024 - Elsevier
Porous media processes involve various physical phenomena such as mechanical
deformation, transport, and fluid flow. Accurate simulations must capture the strong …

Towards resilient parallel linear Krylov solvers: recover-restart strategies

E Agullo, L Giraud, A Guermouche, J Roman… - 2013 - inria.hal.science
The advent of extreme scale machines will require the use of parallel resources at an
unprecedented scale, probably leading to a high rate of hardware faults. High Performance …

Dynamic load balancing on dedicated heterogeneous systems

I Galindo, F Almeida, JM Badía-Contelles - European Parallel Virtual …, 2008 - Springer
Parallel computing in heterogeneous environments is drawing considerable attention due to
the growing number of these kind of systems. Adapting existing code and libraries to such …

Performance-based numerical solver selection in the Lighthouse framework

E Jessup, P Motter, B Norris, K Sood - SIAM Journal on Scientific Computing, 2016 - SIAM
Scientific and engineering computing rely heavily on linear algebra for large-scale data
analysis, modeling and simulation, machine learning, and other applied problems. Sparse …

Dynamic load balancing on heterogeneous multicore/multiGPU systems

A Acosta, R Corujo, V Blanco… - … Conference on High …, 2010 - ieeexplore.ieee.org
Parallel computing in heterogeneous environments is drawing considerable attention due to
the growing number of these kind of systems. Adapting existing code and libraries to such …

Towards the dynamic load balancing on heterogeneous multi-GPU systems

A Acosta, V Blanco, F Almeida - 2012 IEEE 10th International …, 2012 - ieeexplore.ieee.org
The advent of multicore systems, joined to the potential acceleration of the graphics
processing units, alleviates some well known important architectural problems at the …