Performance exploration of various C/C++ compilers for AMD EPYC processors in numerical modeling of solidification
The phase-field (PF) method is a powerful tool for solving interfacial problems in materials
science. This paper's primary goal is to assess the impact of various state-of-the-art C/C++ …
science. This paper's primary goal is to assess the impact of various state-of-the-art C/C++ …
Single‐and multi‐GPU computing on NVIDIA‐and AMD‐based server platforms for solidification modeling application
This work explores the performance of single‐and multi‐GPU computing on state‐of‐the‐art
NVIDIA‐and AMD‐based server‐class hardware using various programming interfaces to …
NVIDIA‐and AMD‐based server‐class hardware using various programming interfaces to …
Architectural adaptation and performance-energy optimization for CFD application on AMD EPYC Rome
L Szustak, R Wyrzykowski… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The advantages of the second-generation AMD EPYC Rome processors can be
successfully used in the race to Exascale. However, the novel architecture's complexity …
successfully used in the race to Exascale. However, the novel architecture's complexity …
Exploration of OpenCL heterogeneous programming for porting solidification modeling to CPU‐GPU platforms
This article provides a comprehensive study of OpenCL heterogeneous programming for
porting applications to CPU–GPU computing platforms, with a real‐life application for the …
porting applications to CPU–GPU computing platforms, with a real‐life application for the …
Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems
L Szustak - The Journal of Supercomputing, 2018 - Springer
In this paper, an innovative strategy for the data-flow synchronization in shared-memory
systems is proposed. This strategy assumes to synchronize only interdependent threads …
systems is proposed. This strategy assumes to synchronize only interdependent threads …
Assessment of offload-based programming environments for hybrid CPU–MIC platforms in numerical modeling of solidification
Heterogeneous (or hybrid) computing platforms with Intel Xeon Phi accelerators offer
potential advantages of energy efficient, massively parallel computing, while supporting …
potential advantages of energy efficient, massively parallel computing, while supporting …
Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors
In this work, we take up the challenge of performance portable programming of
heterogeneous stencil computations across a wide range of modern shared-memory …
heterogeneous stencil computations across a wide range of modern shared-memory …
Dynamic workload prediction and distribution in numerical modeling of solidification on multi‐/manycore architectures
This work is a part of the global tendency to use modern computing systems for modeling the
phase‐field phenomena. The main goal of this article is to improve the performance of a …
phase‐field phenomena. The main goal of this article is to improve the performance of a …
Using hstreams programming library for accelerating a real-life application on intel MIC
The main goal of this paper is the suitability assessment of the hStreams programming
library for porting a real-life scientific application to heterogeneous platforms with Intel Xeon …
library for porting a real-life scientific application to heterogeneous platforms with Intel Xeon …
[PDF][PDF] Exploring OpenMP Accelerator Model in a real-life scientific application using hybrid CPU-MIC platforms
The main goal of this paper is the suitability assessment of the OpenMP Accelerator Model
(OMPAM) for porting a real-life scientific application to heterogeneous platforms containing a …
(OMPAM) for porting a real-life scientific application to heterogeneous platforms containing a …