Benchmarking machine learning methods for performance modeling of scientific applications

P Malakar, P Balaprakash… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Performance modeling is an important and active area of research in high-performance
computing (HPC). It helps in better job scheduling and also improves overall performance of …

Filtering methods for subgraph matching on multiplex networks

JD Moorman, Q Chen, TK Tu, ZM Boyd… - … conference on big …, 2018 - ieeexplore.ieee.org
We present filtering methods for finding all sub-graphs of a large multiplex network that are
isomorphic to a smaller template network. These methods are shown to be effective on a set …

Exascale design space exploration and co-design

SS Dosanjh, RF Barrett, DW Doerfler… - Future Generation …, 2014 - Elsevier
The co-design of architectures and algorithms has been postulated as a strategy for
achieving Exascale computing in this decade. Exascale design space exploration is …

Optimal execution of co-analysis for large-scale molecular dynamics simulations

P Malakar, V Vishwanath, C Knight… - SC'16: Proceedings …, 2016 - ieeexplore.ieee.org
The analysis of scientific simulation data enables scientists to derive insights from their
simulations. This analysis of the simulation output can be performed at the same execution …

Achieving portability and performance through OpenACC

JA Herdman, WP Gaudin, O Perks… - 2014 First Workshop …, 2014 - ieeexplore.ieee.org
OpenACC is a directive-based programming model designed to allow easy access to
emerging advanced architecture systems for existing production codes based on Fortran, C …

The NOMAD mini-apps: A suite of kernels from ab initio electronic structure codes enabling co-design in high-performance computing

IM Magre, RG Torres, JMC Espín… - Open Research …, 2024 - pmc.ncbi.nlm.nih.gov
This article introduces a suite of mini-applications (mini-apps) designed to optimise
computational kernels in ab initio electronic structure codes. The suite is developed from …

Double-precision fpus in high-performance computing: an embarrassment of riches?

J Domke, K Matsumura, M Wahib… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
Among the (uncontended) common wisdom in High-Performance Computing (HPC) is the
applications' need for large amount of double-precision support in hardware. Hardware …

Toward an evolutionary task parallel integrated MPI+ X programming model

RF Barrett, DT Stark, CT Vaughan, RE Grant… - Proceedings of the …, 2015 - dl.acm.org
The Bulk Synchronous Parallel programming model is showing performance limitations at
high processor counts. We propose over-decomposition of the domain, operated on as …

Early experiences co-scheduling work and communication tasks for hybrid MPI+ X applications

DT Stark, RF Barrett, RE Grant… - 2014 Workshop on …, 2014 - ieeexplore.ieee.org
Advances in node-level architecture and interconnect technology needed to reach extreme
scale necessitate a reevaluation of long-standing models of computation, in particular bulk …

miniVite: A graph analytics benchmarking tool for massively parallel systems

S Ghosh, M Halappanavar, A Tumeo… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Benchmarking of high performance computing systems can help provide critical insights for
efficient design of computing systems and software applications. Although a large number of …