Parallel programming models for heterogeneous many-cores: a comprehensive survey

J Fang, C Huang, T Tang, Z Wang - CCF Transactions on High …, 2020 - Springer
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …

Harnessing billions of tasks for a scalable portable hydrodynamic simulation of the merger of two stars

T Heller, BA Lelbach, KA Huck… - … Journal of High …, 2019 - journals.sagepub.com
We present a highly scalable demonstration of a portable asynchronous many-task
programming model and runtime system applied to a grid-based adaptive mesh refinement …

From task-based gpu work aggregation to stellar mergers: Turning fine-grained cpu tasks into portable gpu kernels

G Daiß, P Diehl, D Marcello… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Meeting both scalability and performance portability requirements is a challenge for any
HPC application, especially for adaptively refined ones. In Octo-Tiger, an astrophysics …

Vectorization of high-performance scientific calculations using avx-512 intruction set

BM Shabanov, AA Rybakov, SS Shumilin - Lobachevskii Journal of …, 2019 - Springer
Modern calculation codes used in supercomputing are very demanding of computing
resources. For their effective appliance requires the use of parallelization at all levels …

From merging frameworks to merging stars: Experiences using hpx, kokkos and simd types

G Daiß, SY Singanaboina, P Diehl… - 2022 IEEE/ACM 7th …, 2022 - ieeexplore.ieee.org
Octo-Tiger, a large-scale 3D AMR code for the merger of stars, uses a combination of HPX,
Kokkos and explicit SIMD types, aiming to achieve performance-portability for a broad range …