A review on regional convection‐permitting climate modeling: Demonstrations, prospects, and challenges

AF Prein, W Langhans, G Fosser… - Reviews of …, 2015 - Wiley Online Library
Regional climate modeling using convection‐permitting models (CPMs; horizontal grid
spacing< 4 km) emerges as a promising framework to provide more reliable climate …

I/o access patterns in hpc applications: A 360-degree survey

JL Bez, S Byna, S Ibrahim - ACM Computing Surveys, 2023 - dl.acm.org
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …

A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation

L Kneip, D Scaramuzza, R Siegwart - CVPR 2011, 2011 - ieeexplore.ieee.org
The Perspective-Three-Point (P3P) problem aims at determining the position and orientation
of the camera in the world reference frame from three 2D-3D point correspondences. This …

Score-p: A joint performance measurement run-time infrastructure for periscope, scalasca, tau, and vampir

A Knüpfer, C Rössel, D Mey, S Biersdorff… - Tools for High …, 2012 - Springer
This paper gives an overview about the Score-P performance measurement infrastructure
which is being jointly developed by leading HPC performance tools groups. It motivates the …

Caliper: performance introspection for HPC software stacks

D Boehme, T Gamblin, D Beckingsale… - SC'16: Proceedings …, 2016 - ieeexplore.ieee.org
Many performance engineering tasks, from long-term performance monitoring to post-
mortem analysis and online tuning, require efficient runtime methods for introspection and …

[HTML][HTML] Anatomically accurate high resolution modeling of human whole heart electromechanics: a strongly scalable algebraic multigrid solver method for nonlinear …

CM Augustin, A Neic, M Liebmann, AJ Prassl… - Journal of computational …, 2016 - Elsevier
Electromechanical (EM) models of the heart have been used successfully to study
fundamental mechanisms underlying a heart beat in health and disease. However, in all …

Petascale high order dynamic rupture earthquake simulations on heterogeneous supercomputers

A Heinecke, A Breuer, S Rettenberger… - SC'14: Proceedings …, 2014 - ieeexplore.ieee.org
We present an end-to-end optimization of the innovative Arbitrary high-order DERivative
Discontinuous Galerkin (ADER-DG) software SeisSol targeting Intel® Xeon Phi coprocessor …

Using automated performance modeling to find scalability bugs in complex codes

A Calotoiu, T Hoefler, M Poke, F Wolf - Proceedings of the International …, 2013 - dl.acm.org
Many parallel applications suffer from latent performance limitations that may prevent them
from scaling to larger machine sizes. Often, such scalability bugs manifest themselves only …

Trends in data locality abstractions for HPC systems

D Unat, A Dubey, T Hoefler, J Shalf… - … on Parallel and …, 2017 - ieeexplore.ieee.org
The cost of data movement has always been an important concern in high performance
computing (HPC) systems. It has now become the dominant factor in terms of both energy …

Open trace format 2: The next generation of scalable trace formats and support libraries

D Eschweiler, M Wagner, M Geimer… - … and Techniques on …, 2012 - ebooks.iospress.nl
A well designed event trace data format is the basis of all trace-based analysis methods. In
this paper, we introduce the Open Trace Format Version 2 (OTF2). It is a major re-design …