ECP software technology capability assessment report

MA Heroux, LC McInnes, R Thakur, JS Vetter, XS Li… - 2020 - osti.gov
The Exascale Computing Project (ECP) Software Technology (ST) Focus Area is
responsible for develo** critical software capabilities that will enable successful execution …

Give MPI threading a fair chance: A study of multithreaded MPI designs

T Patinyasakdikul, D Eberius… - … Conference on Cluster …, 2019 - ieeexplore.ieee.org
The Message Passing Interface (MPI) has been one of the most prominent programming
paradigms in high-performance computing (HPC) for the past decade. Lately, with changes …

Process-in-process: techniques for practical address-space sharing

A Hori, M Si, B Gerofi, M Takagi, J Dayal… - Proceedings of the 27th …, 2018 - dl.acm.org
The two most common parallel execution models for many-core CPUs today are
multiprocess (eg, MPI) and multithread (eg, OpenMP). The multiprocess model allows each …

A lightweight communication runtime for distributed graph analytics

HV Dang, R Dathathri, G Gill, A Brooks… - 2018 IEEE …, 2018 - ieeexplore.ieee.org
Distributed-memory multi-core clusters enable in-memory processing of very large graphs
with billions of nodes and edges. Recent distributed graph analytics systems have been built …

Lock contention management in multithreaded mpi

A Amer, H Lu, P Balaji, M Chabbi, Y Wei… - ACM Transactions on …, 2019 - dl.acm.org
In this article, we investigate contention management in lock-based thread-safe MPI
libraries. Specifically, we make two assumptions:(1) locks are the only form of …

Optimizing point‐to‐point communication between adaptive MPI endpoints in shared memory

S White, LV Kale - Concurrency and Computation: Practice and …, 2020 - Wiley Online Library
Adaptive MPI is an implementation of the MPI standard that supports the virtualization of
ranks as user‐level threads, rather than OS processes. In this work, we optimize the …

Software combining to mitigate multithreaded MPI contention

A Amer, C Archer, M Blocksome, C Cao… - Proceedings of the …, 2019 - dl.acm.org
Efforts to mitigate lock contention from concurrent threaded accesses to MPI have reduced
contention through fine-grained locking, avoided locking altogether by offloading …

Techniques for representing and processing geometry within a graphics processing pipeline

Z Hakura, Y Uralsky, C Kubisch, P Boudier… - US Patent …, 2020 - Google Patents
In various embodiments, a parallel processor implements a graphics processing pipeline
that generates rendered images via a shading program. In operation, the parallel processor …

Using simulation to examine the effect of MPI message matching costs on application performance

S Levy, KB Ferreira - Proceedings of the 25th European MPI Users' …, 2018 - dl.acm.org
Attaining high performance with MPI applications requires efficient message matching to
minimize message processing overheads and the latency these overheads introduce into …

Using relaxed concurrent data structures for contention minimization in multithreaded MPI programs

AV Tabakov, AA Paznikov - Journal of Physics: Conference …, 2019 - iopscience.iop.org
Parallel computing is one of the top priorities in computer science. The main means of
parallel processing information is a distributed computing system (CS)-a composition of …