Evaluating the performance of HPC-style SYCL applications
SYCL is a parallel programming model for develo** single-source programs for running
on heterogeneous platforms. To this end, it allows for one code to be written which can run …
on heterogeneous platforms. To this end, it allows for one code to be written which can run …
Porting a legacy cuda stencil code to oneapi
S Christgau, T Steinke - 2020 IEEE International Parallel and …, 2020 - ieeexplore.ieee.org
Recently, Intel released the oneAPI programming environment. With Data Parallel C++
(DPC++), oneAPI enables codes to target multiple hardware architectures like multi-core …
(DPC++), oneAPI enables codes to target multiple hardware architectures like multi-core …
Enhancing Kokkos with OpenACC
C++ template metaprogramming has emerged as a prominent approach for achieving
performance portability in heterogeneous computing. Kokkos represents a notable paradigm …
performance portability in heterogeneous computing. Kokkos represents a notable paradigm …
Performance Study of GPU applications using SYCL and CUDA on Tesla V100 GPU
SYCL standard enables single-source programs to run on heterogeneous platforms
consisting of CPUs, GPUs, FPGAs across different hardware vendors. SYCL combines …
consisting of CPUs, GPUs, FPGAs across different hardware vendors. SYCL combines …
Performance portability study of epistasis detection using sycl on nvidia gpu
We describe the experience of converting a CUDA implementation of a high-order epistasis
detection algorithm to SYCL. The goals are for our work to be useful to application and …
detection algorithm to SYCL. The goals are for our work to be useful to application and …
A Performance-Portable SYCL Implementation of CRK-HACC for Exascale
The first generation of exascale systems will include a variety of machine architectures,
featuring GPUs from multiple vendors. As a result, many developers are interested in …
featuring GPUs from multiple vendors. As a result, many developers are interested in …
Case study of using Kokkos and SYCL as performance-portable frameworks for Milc-Dslash benchmark on NVIDIA, AMD and Intel GPUs
Six of the top ten supercomputers in the TOP500 list from June 2021 rely on NVIDIA GPUs to
achieve their peak compute bandwidth. With the announcement of Aurora, Frontier, and El …
achieve their peak compute bandwidth. With the announcement of Aurora, Frontier, and El …
KokkACC: enhancing Kokkos with openACC
Template metaprogramming is gaining popularity as a high-level solution for achieving
performance portability on heterogeneous computing resources. Kokkos is a representative …
performance portability on heterogeneous computing resources. Kokkos is a representative …
Outcomes of openMP hackathon: openMP application experiences with the offloading model (part II)
This paper reports on experiences gained and practices adopted when using the latest
features of OpenMP to port a variety of HPC applications and mini-apps based on different …
features of OpenMP to port a variety of HPC applications and mini-apps based on different …
Evaluating cuda portability with hipcl and dpct
HIPCL is expanding the scope of the CUDA portability route from an AMD platform to an
OpenCL platform. In the meantime, the Intel DPC++ Compatibility Tool (DPCT) is migrating a …
OpenCL platform. In the meantime, the Intel DPC++ Compatibility Tool (DPCT) is migrating a …