Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes
We explore the performance and portability of the high-level programming models: the
LLVM-based Julia and Python/Numba, and Kokkos on high-performance computing (HPC) …
LLVM-based Julia and Python/Numba, and Kokkos on high-performance computing (HPC) …
Evaluation of performance portability of applications and mini-apps across amd, intel and nvidia gpus
This paper will evaluate the progress being made on achieving performance portability by a
sub-set of ECP applications, or their related mini-apps, across a diverse spectrum of …
sub-set of ECP applications, or their related mini-apps, across a diverse spectrum of …
Revisiting a metric for performance portability
Our previously proposed metric for performance portability has been successful in spurring
the development of tools and encouraging fair and meaningful evaluations of applications …
the development of tools and encouraging fair and meaningful evaluations of applications …
Interpreting and visualizing performance portability metrics
Recent work has introduced a number of tools and techniques for reasoning about the
interplay between application performance and portability, or" performance portability" …
interplay between application performance and portability, or" performance portability" …
A comparison of two performance portability metrics
A Marowka - Concurrency and Computation: Practice and …, 2023 - Wiley Online Library
The rise in the demand for new performance portability frameworks for heterogeneous
computing systems has brought with it a number of proposals of workable metrics for …
computing systems has brought with it a number of proposals of workable metrics for …
[HTML][HTML] Enabling performance portability on the LiGen drug discovery pipeline
In recent years, there has been a growing interest in develo** high-performance
implementations of drug discovery processing software. To target modern GPU …
implementations of drug discovery processing software. To target modern GPU …
PETSc/TAO developments for GPU-based early exascale systems
The Portable Extensible Toolkit for Scientific Computation (PETSc) library provides scalable
solvers for nonlinear time-dependent differential and algebraic equations and for numerical …
solvers for nonlinear time-dependent differential and algebraic equations and for numerical …
Performance portability of sparse block diagonal matrix multiple vector multiplications on gpus
The emergence of accelerator-based computer architectures and programming models
makes it challenging to achieve performance portability for large-scale scientific simulation …
makes it challenging to achieve performance portability for large-scale scientific simulation …
Optimizing performance and energy efficiency in massively parallel systems
R Nozal - 2022 - repositorio.unican.es
Heterogeneous systems are becoming increasingly relevant, due to their performance and
energy efficiency capabilities, being present in all types of computing platforms, from …
energy efficiency capabilities, being present in all types of computing platforms, from …
A comprehensive modeling approach for the task map** problem in heterogeneous systems with dataflow processing units
M Wilhelm, H Geppert, A Drewes… - Concurrency and …, 2023 - Wiley Online Library
We introduce a new model for the task map** problem to aid in the systematic design of
algorithms for heterogeneous systems including, but not limited to, CPUs, GPUs, and …
algorithms for heterogeneous systems including, but not limited to, CPUs, GPUs, and …