[PDF][PDF] Taking GPU Programming Models to Task for Performance Portability
Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search
The high-performance computing (HPC) landscape is undergoing rapid transformation, with
an increasing emphasis on energy-efficient and heterogeneous computing environments …
an increasing emphasis on energy-efficient and heterogeneous computing environments …
On the Inorrect Use of Application Efficiency to Calculate Performance Portability
A Marowka - arxiv preprint arxiv:2407.00232, 2024 - arxiv.org
The emergence of heterogeneity in high-performance computing, which harnesses under
one integrated system several platforms of different architectures, also led to the …
one integrated system several platforms of different architectures, also led to the …
Evaluation of computational and energy performance in matrix multiplication algorithms on CPU and GPU using MKL, cuBLAS and SYCL
LA Torres, Y Denneulin - arxiv preprint arxiv:2405.17322, 2024 - arxiv.org
Matrix multiplication is fundamental in the backpropagation algorithm used to train deep
neural network models. Libraries like Intel's MKL or NVIDIA's cuBLAS implemented new and …
neural network models. Libraries like Intel's MKL or NVIDIA's cuBLAS implemented new and …
Ponte Vecchio Across the Atlantic: Single-Node Benchmarking of Two Intel GPU Systems
Intel Data Center GPU Max 1550, known as Ponte Vecchio (PVC), is a new Intel GPU
architecture for high-performance computing. It is the basis of two systems on the June 2024 …
architecture for high-performance computing. It is the basis of two systems on the June 2024 …
Development of performance portable spline solver for exa-scale plasma turbulence simulation
This paper describes the development of performance portable spline building kernels on
top of Kokkos-kernels. We wish to solve a single matrix equation with multiple right-hand …
top of Kokkos-kernels. We wish to solve a single matrix equation with multiple right-hand …
GenVectorX: A performance-portable SYCL library for Lorentz Vectors operations
The Large Hadron Collider (LHC) at CERN will see an upgraded hardware configuration
which will bring a new era of physics data taking and related computational challenges. To …
which will bring a new era of physics data taking and related computational challenges. To …
Unlocking performance portability on LUMI-G supercomputer: A virtual screening case study
High-Performance Computing is the target system for virtual screening applications, which
aim to suggest which candidates to test in the drug discovery process. The HPC …
aim to suggest which candidates to test in the drug discovery process. The HPC …
Experiences with implementing Kokkos' SYCL backend
With the recent diversification of the hardware landscape in the high-performance computing
community, performance-portability solutions are becoming more and more important. One …
community, performance-portability solutions are becoming more and more important. One …
[PDF][PDF] Portability Efficiency Approach for Calculating Performance Portability
A Marowka - researchgate.net
The emergence of heterogeneity in high-performance computing, which harnesses under
one integrated system several platforms of different architectures, also led to the …
one integrated system several platforms of different architectures, also led to the …