Performance exploration of various C/C++ compilers for AMD EPYC processors in numerical modeling of solidification

K Halbiniak, R Wyrzykowski, L Szustak… - … in Engineering Software, 2022 - Elsevier
The phase-field (PF) method is a powerful tool for solving interfacial problems in materials
science. This paper's primary goal is to assess the impact of various state-of-the-art C/C++ …

Single‐and multi‐GPU computing on NVIDIA‐and AMD‐based server platforms for solidification modeling application

K Halbiniak, N Meyer, K Rojek - Concurrency and Computation …, 2024 - Wiley Online Library
This work explores the performance of single‐and multi‐GPU computing on state‐of‐the‐art
NVIDIA‐and AMD‐based server‐class hardware using various programming interfaces to …

Architectural adaptation and performance-energy optimization for CFD application on AMD EPYC Rome

L Szustak, R Wyrzykowski… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The advantages of the second-generation AMD EPYC Rome processors can be
successfully used in the race to Exascale. However, the novel architecture's complexity …

Exploration of OpenCL heterogeneous programming for porting solidification modeling to CPU‐GPU platforms

K Halbiniak, L Szustak, T Olas… - Concurrency and …, 2021 - Wiley Online Library
This article provides a comprehensive study of OpenCL heterogeneous programming for
porting applications to CPU–GPU computing platforms, with a real‐life application for the …

Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems

L Szustak - The Journal of Supercomputing, 2018 - Springer
In this paper, an innovative strategy for the data-flow synchronization in shared-memory
systems is proposed. This strategy assumes to synchronize only interdependent threads …

Assessment of offload-based programming environments for hybrid CPU–MIC platforms in numerical modeling of solidification

K Halbiniak, R Wyrzykowski, L Szustak… - … Modelling Practice and …, 2018 - Elsevier
Heterogeneous (or hybrid) computing platforms with Intel Xeon Phi accelerators offer
potential advantages of energy efficient, massively parallel computing, while supporting …

Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors

L Szustak, P Bratek - The International Journal of High …, 2019 - journals.sagepub.com
In this work, we take up the challenge of performance portable programming of
heterogeneous stencil computations across a wide range of modern shared-memory …

Dynamic workload prediction and distribution in numerical modeling of solidification on multi‐/manycore architectures

K Halbiniak, T Olas, L Szustak… - Concurrency and …, 2021 - Wiley Online Library
This work is a part of the global tendency to use modern computing systems for modeling the
phase‐field phenomena. The main goal of this article is to improve the performance of a …

Using hstreams programming library for accelerating a real-life application on intel MIC

L Szustak, K Halbiniak, A Kulawik… - … and Architectures for …, 2016 - Springer
The main goal of this paper is the suitability assessment of the hStreams programming
library for porting a real-life scientific application to heterogeneous platforms with Intel Xeon …

[PDF][PDF] Exploring OpenMP Accelerator Model in a real-life scientific application using hybrid CPU-MIC platforms

K Halbiniak, L Szustak, A Lastovetsky… - Proceedings 3rd …, 2016 - e-archivo.uc3m.es
The main goal of this paper is the suitability assessment of the OpenMP Accelerator Model
(OMPAM) for porting a real-life scientific application to heterogeneous platforms containing a …