Benchmarking machine learning methods for performance modeling of scientific applications
Performance modeling is an important and active area of research in high-performance
computing (HPC). It helps in better job scheduling and also improves overall performance of …
computing (HPC). It helps in better job scheduling and also improves overall performance of …
Filtering methods for subgraph matching on multiplex networks
We present filtering methods for finding all sub-graphs of a large multiplex network that are
isomorphic to a smaller template network. These methods are shown to be effective on a set …
isomorphic to a smaller template network. These methods are shown to be effective on a set …
Exascale design space exploration and co-design
SS Dosanjh, RF Barrett, DW Doerfler… - Future Generation …, 2014 - Elsevier
The co-design of architectures and algorithms has been postulated as a strategy for
achieving Exascale computing in this decade. Exascale design space exploration is …
achieving Exascale computing in this decade. Exascale design space exploration is …
Optimal execution of co-analysis for large-scale molecular dynamics simulations
The analysis of scientific simulation data enables scientists to derive insights from their
simulations. This analysis of the simulation output can be performed at the same execution …
simulations. This analysis of the simulation output can be performed at the same execution …
Achieving portability and performance through OpenACC
JA Herdman, WP Gaudin, O Perks… - 2014 First Workshop …, 2014 - ieeexplore.ieee.org
OpenACC is a directive-based programming model designed to allow easy access to
emerging advanced architecture systems for existing production codes based on Fortran, C …
emerging advanced architecture systems for existing production codes based on Fortran, C …
The NOMAD mini-apps: A suite of kernels from ab initio electronic structure codes enabling co-design in high-performance computing
IM Magre, RG Torres, JMC Espín… - Open Research …, 2024 - pmc.ncbi.nlm.nih.gov
This article introduces a suite of mini-applications (mini-apps) designed to optimise
computational kernels in ab initio electronic structure codes. The suite is developed from …
computational kernels in ab initio electronic structure codes. The suite is developed from …
Double-precision fpus in high-performance computing: an embarrassment of riches?
Among the (uncontended) common wisdom in High-Performance Computing (HPC) is the
applications' need for large amount of double-precision support in hardware. Hardware …
applications' need for large amount of double-precision support in hardware. Hardware …
Toward an evolutionary task parallel integrated MPI+ X programming model
The Bulk Synchronous Parallel programming model is showing performance limitations at
high processor counts. We propose over-decomposition of the domain, operated on as …
high processor counts. We propose over-decomposition of the domain, operated on as …
Early experiences co-scheduling work and communication tasks for hybrid MPI+ X applications
Advances in node-level architecture and interconnect technology needed to reach extreme
scale necessitate a reevaluation of long-standing models of computation, in particular bulk …
scale necessitate a reevaluation of long-standing models of computation, in particular bulk …
miniVite: A graph analytics benchmarking tool for massively parallel systems
Benchmarking of high performance computing systems can help provide critical insights for
efficient design of computing systems and software applications. Although a large number of …
efficient design of computing systems and software applications. Although a large number of …