Scalable distributed high-order stencil computations

M Jacquelin, M Araya–Polo… - … Conference for High …, 2022 - ieeexplore.ieee.org
Stencil computations lie at the heart of many scientific and industrial applications. Stencil
algorithms pose several challenges on machines with cache based memory hierarchy, due …

A pattern-based comparison of OpenACC and OpenMP for accelerator computing

S Wienke, C Terboven, JC Beyer, MS Müller - Euro-Par 2014 Parallel …, 2014 - Springer
Nowadays, HPC systems frequently emerge as clusters of commodity processors with
attached accelerators. Moving from tedious low-level accelerator programming to increased …

Minimod: A finite difference solver for seismic modeling

J Meng, A Atle, H Calandra, M Araya-Polo - arxiv preprint arxiv …, 2020 - arxiv.org
This article introduces a benchmark application for seismic modeling using finite difference
method, which is namedMiniMod, a mini application for seismic modeling. The purpose is to …

Performance portability in reverse time migration and seismic modelling via OpenACC

A Qawasmeh, MR Hugues… - … Journal of High …, 2017 - journals.sagepub.com
Heterogeneity among the computational resources within a single machine has significantly
increased in high performance computing to exploit the tremendous potential of graphics …

Exploring parallel programming models for heterogeneous computing systems

M Daga, ZS Tschirhart, C Freitag - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
Parallel systems that employ CPUs and GPUs as two heterogeneous computational units
have become immensely popular due to their ability to maximize performance under …

Evaluating performance of OpenMP tasks in a seismic stencil application

E Raut, J Meng, M Araya-Polo, B Chapman - OpenMP: Portable Multi …, 2020 - Springer
Simulations based on stencil computations (widely used in geosciences) have been
dominated by the MPI+ OpenMP programming model paradigm. Little effort has been …

Porting and evaluation of a distributed task-driven stencil-based application

E Raut, J Anderson, M Araya-Polo, J Meng - Proceedings of the 12th …, 2021 - dl.acm.org
Alternative programming models and runtimes are increasing in popularity and maturity.
This allows porting and comparing, on competitive grounds, emerging parallel approaches …

JACC: an openacc runtime framework with kernel-level and multi-gpu parallelization

K Matsumura, SG De Gonzalo… - 2021 IEEE 28th …, 2021 - ieeexplore.ieee.org
The rapid development in computing technology has paved the way for directive-based
programming models towards a principal role in maintaining software portability of …

Parallel computation of a dam-break flow model using OpenACC applications

S Zhang, R Yuan, Y Wu, Y Yi - Journal of hydraulic engineering, 2017 - ascelibrary.org
Two key factors in dam-break modeling are accuracy and speed. Therefore, high-
performance calculations are of great importance to the simulation of dam-break events. In …

Massively scalable stencil algorithm

M Jacquelin, M Araya-Polo, J Meng - arxiv preprint arxiv:2204.03775, 2022 - arxiv.org
Stencil computations lie at the heart of many scientific and industrial applications.
Unfortunately, stencil algorithms perform poorly on machines with cache based memory …