Bi-objective optimization of data-parallel applications on heterogeneous HPC platforms for performance and energy through workload distribution

H Khaleghzadeh, M Fahad, A Shahid… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Performance and energy are the two most important objectives for optimization on modern
parallel platforms. In this article, we show that moving from single-objective optimization for …

New model-based methods and algorithms for performance and energy optimization of data parallel applications on homogeneous multicore clusters

A Lastovetsky, RR Manumachu - IEEE Transactions on Parallel …, 2016 - ieeexplore.ieee.org
Modern homogeneous parallel platforms are composed of tightly integrated multicore CPUs.
This tight integration has resulted in the cores contending for various shared on-chip …

[KNIHA][B] Parallel algorithms

H Casanova, A Legrand, Y Robert - 2008 - taylorfrancis.com
Focusing on algorithms for distributed-memory parallel architectures, Parallel Algorithms
presents a rigorous yet accessible treatment of theoretical models of parallel computation …

A novel data-partitioning algorithm for performance optimization of data-parallel applications on heterogeneous HPC platforms

H Khaleghzadeh, RR Manumachu… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Modern HPC platforms have become highly heterogeneous owing to tight integration of
multicore CPUs and accelerators (such as Graphics Processing Units, Intel Xeon Phis, or …

Data partitioning with a functional performance model of heterogeneous processors

A Lastovetsky, R Reddy - The International Journal of High …, 2007 - journals.sagepub.com
In this paper, we address the problem of optimal distribution of computational tasks on a
network of heterogeneous computers when one or more tasks do not fit into the main …

[PDF][PDF] An overview of heterogeneous high performance and grid computing

J Dongarra, A Lastovetsky - Engineering the Grid: Status and …, 2006 - academia.edu
This paper is an overview the ongoing academic research, development, and uses of
heterogeneous parallel and distributed computing. This work is placed in the context of …

The master-slave paradigm with heterogeneous processors

O Beaumont, A Legrand… - IEEE Transactions on …, 2003 - ieeexplore.ieee.org
We revisit the master-slave tasking paradigm in the context of heterogeneous processors.
We assume that communications are handled by a bus and, therefore, at most one …

[KNIHA][B] High performance heterogeneous computing

J Dongarra, AL Lastovetsky - 2009 - books.google.com
An analytical overview of the state of the art, open problems, and future trends in
heterogeneous parallel and distributed computing This book provides an overview of the …

Partitioning a square into rectangles: NP-completeness and approximation algorithms

Beaumont, Boudet, Rastello - Algorithmica, 2002 - Springer
In this paper we deal with two geometric problems arising from heterogeneous parallel
computing: how to partition the unit square into p rectangles of given area s 1, s 2,..., sp …

Matrix multiplication on high-density multi-GPU architectures: theoretical and experimental investigations

P Zhang, Y Gao - … Computing: 30th International Conference, ISC High …, 2015 - Springer
Matrix multiplication (MM) is one of the core problems in the high performance computing
domain and its efficiency impacts performances of almost all matrix problems. The high …