[PDF][PDF] Cloverleaf: Preparing hydrodynamics codes for exascale

A Mallinson, DA Beckingsale, W Gaudin, J Herdman… - The Cray User …, 2013 - cug.org
▪ The C bindings make interfacing with Fortran difficult▪ Global class implemented to
coordinate data transfers with and computation on the GPU▪ Data created and initialised on …

Programming for exascale computers

W Gropp, M Snir - Computing in Science & Engineering, 2013 - ieeexplore.ieee.org
Exascale systems will present programmers with many challenges. The authors review the
parallel programming models that are appropriate for such systems and the challenges that …

A parallel competitive Particle Swarm Optimization for non-linear first arrival traveltime tomography and uncertainty quantification

K Luu, M Noble, A Gesret, N Belayouni… - Computers & Geosciences, 2018 - Elsevier
Seismic traveltime tomography is an optimization problem that requires large computational
efforts. Therefore, linearized techniques are commonly used for their low computational cost …

Performance management of accelerated mapreduce workloads in heterogeneous clusters

J Polo, D Carrera, Y Becerra, V Beltran… - 2010 39th …, 2010 - ieeexplore.ieee.org
Next generation data centers will be composed of thousands of hybrid systems in an attempt
to increase overall cluster performance and to minimize energy consumption. New …

Performance modeling of communication and computation in hybrid MPI and OpenMP applications

L Adhianto, B Chapman - Simulation Modelling Practice and Theory, 2007 - Elsevier
Performance evaluation and modeling are crucial steps to enabling the optimization of
parallel programs. Programs written using two programming models, such as MPI and …

Pencil: A pipelined algorithm for distributed stencils

H Wang… - … Conference for High …, 2020 - ieeexplore.ieee.org
Stencil computations are at the core of various Computational Fluid Dynamics (CFD)
applications and have been well-studied for several decades. Typically they're highly …

Comparison between pure MPI and hybrid MPI-OpenMP parallelism for Discrete Element Method (DEM) of ellipsoidal and poly-ellipsoidal particles

B Yan, RA Regueiro - Computational Particle Mechanics, 2019 - Springer
Parallel computing of 3D Discrete Element Method (DEM) simulations can be achieved in
different modes, and two of them are pure MPI and hybrid MPI-OpenMP. The hybrid MPI …

HBPFP-DC: A parallel frequent itemset mining using Spark

Y Xun, J Zhang, H Yang, X Qin - Parallel Computing, 2021 - Elsevier
The frequent itemset mining (FIM) is one of the most important techniques to extract
knowledge from data in many real-world applications. Facing big data applications, parallel …

MPI collectives for multi-core clusters: Optimized performance of the hybrid MPI+ MPI parallel codes

H Zhou, J Gracia, R Schneider - … of the 48th International Conference on …, 2019 - dl.acm.org
The advent of multi-/many-core processors in clusters advocates hybrid parallel
programming, which combines Message Passing Interface (MPI) for inter-node parallelism …

The tiny-tasks granularity trade-off: Balancing overhead versus performance in parallel systems

S Bora, B Walker, M Fidler - IEEE Transactions on Parallel and …, 2023 - ieeexplore.ieee.org
Models of parallel processing systems typically assume that one has workers and jobs are
split into an equal number of tasks. Splitting jobs into smaller tasks, ie using “tiny tasks”, can …