State-of-the-art in heterogeneous computing

AR Brodtkorb, C Dyken, TR Hagen… - Scientific …, 2010 - content.iospress.com
Node level heterogeneous architectures have become attractive during the last decade for
several reasons: compared to traditional symmetric CPUs, they offer high peak performance …

A dependency-aware task-based programming environment for multi-core architectures

JM Perez, RM Badia, J Labarta - 2008 IEEE international …, 2008 - ieeexplore.ieee.org
Parallel programming on SMP and multi-core architectures is hard. In this paper we present
a programming model for those environments based on automatic function level parallelism …

Productive programming of GPU clusters with OmpSs

J Bueno, J Planas, A Duran, RM Badia… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
Clusters of GPUs are emerging as a new computational scenario. Programming them
requires the use of hybrid models that increase the complexity of the applications, reducing …

Hierarchical task-based programming with StarSs

J Planas, RM Badia, E Ayguadé… - … International Journal of …, 2009 - journals.sagepub.com
Programming models for multicore and many-core systems are listed as one of the main
challenges in the near future for computing research. These programming models should be …

An extension of the StarSs programming model for platforms with multiple GPUs

E Ayguadé, RM Badia, FD Igual, J Labarta… - Euro-Par 2009 Parallel …, 2009 - Springer
While general-purpose homogeneous multi-core architectures are becoming ubiquitous,
there are clear indications that, for a number of important applications, a better …

Productive cluster programming with OmpSs

J Bueno, L Martinell, A Duran, M Farreras… - Euro-Par 2011 Parallel …, 2011 - Springer
Clusters of SMPs are ubiquitous. They have been traditionally programmed by using MPI.
But, the productivity of MPI programmers is low because of the complexity of expressing …

An algorithm for the optimal control of the driving of trains

R Franke, P Terwiesch, M Meyer - Proceedings of the 39th IEEE …, 2000 - ieeexplore.ieee.org
We discuss an algorithm that optimizes the driving style of a train. The objective is to
minimize the electrical energy used for traction subject to constraints such as the travel time …

Scheduling dense linear algebra operations on multicore processors

J Kurzak, H Ltaief, J Dongarra… - … Practice and Experience, 2010 - Wiley Online Library
State‐of‐the‐art dense linear algebra software, such as the LAPACK and ScaLAPACK
libraries, suffers performance losses on multicore processors due to their inability to fully …

[PDF][PDF] Parallel programming models for dense linear algebra on heterogeneous systems

J Dongarra, M Abalenkovs, A Abdelfattah… - Supercomputing …, 2015 - superfri.susu.ru
We present a review of the current best practices in parallel programming models for dense
linear algebra (DLA) on heterogeneous architectures. We consider multicore CPUs, stand …

Contention-aware fair scheduling for asymmetric single-ISA multicore systems

A Garcia-Garcia, JC Saez… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Asymmetric single-ISA multicore processors (AMPs), which integrate high-performance big
cores and low-power small cores, were shown to deliver higher performance per watt than …