Programming for exascale computers

W Gropp, M Snir - Computing in Science & Engineering, 2013 - ieeexplore.ieee.org
Exascale systems will present programmers with many challenges. The authors review the
parallel programming models that are appropriate for such systems and the challenges that …

Process-in-process: techniques for practical address-space sharing

A Hori, M Si, B Gerofi, M Takagi, J Dayal… - Proceedings of the 27th …, 2018 - dl.acm.org
The two most common parallel execution models for many-core CPUs today are
multiprocess (eg, MPI) and multithread (eg, OpenMP). The multiprocess model allows each …

MPI 3 and beyond: why MPI is successful and what challenges it faces

W Gropp - European MPI Users' Group Meeting, 2012 - Springer
Abstract The Message Passing Interface (MPI) was developed over eighteen years ago and
continues to be the preferred programming model for scientific computing. Contributing to …

Frequency enhancement of digital VLSI test systems

L Ackner, MR Barber - Proceedings. International Test …, 1990 - ieeexplore.ieee.org
The authors first discuss the intrinsic frequency limitations of the wave formatters, DUT
(device-under-test) drivers, and comparators in a typical 1980s 40-80-MHz test system, and …

Memory compression techniques for network address management in MPI

Y Guo, CJ Archer, M Blocksome… - 2017 IEEE …, 2017 - ieeexplore.ieee.org
MPI allows applications to treat processes as a logical collection of integer ranks for each
MPI communicator, while internally translating these logical ranks into actual network …

Engineering algorithms for scalability through continuous validation of performance expectations

S Shudler, Y Berens, A Calotoiu… - … on Parallel and …, 2019 - ieeexplore.ieee.org
Many libraries in the HPC field use sophisticated algorithms with clear theoretical scalability
expectations. However, hardware constraints or programming bugs may sometimes render …

Software-managed cache coherence for fast one-sided communication

S Christgau, B Schnor - Proceedings of the 7th International Workshop …, 2016 - dl.acm.org
The ongoing many-core design aims at core counts where cache coherence becomes a
serious challenge. Therefore, this paper discusses how one-sided communication can be …

A scalable approach to visualization of large virtual cities

J Zara, P Chromy, J Cizek, K Ghais… - Proceedings Fifth …, 2001 - ieeexplore.ieee.org
Visualization of large urban complexes on the Web is a highly demanding task both from the
networking and computational point of view. The whole three-dimensional model of a city is …

Optimisation of the parallel performances of a CFD solver for emerging computational platforms

F Gava - 2022 - theses.hal.science
The importance of Computational Fluid Dynamics in the industrial design process has
increased dramatically in the last two decades. This is mainly due to the technological …

Runtime techniques for efficient execution of virtualized, migratable MPI ranks

S White - 2022 - ideals.illinois.edu
Abstract The Message Passing Interface (MPI) is the dominant programming system for
scientific applications that run on distributed memory parallel computers. MPI is a library …