Adaptive optimization for OpenCL programs on embedded heterogeneous systems

B Taylor, VS Marco, Z Wang - ACM SIGPLAN Notices, 2017 - dl.acm.org
Heterogeneous multi-core architectures consisting of CPUs and GPUs are commonplace in
today's embedded systems. These architectures offer potential for energy efficient computing …

Energy efficiency of load balancing for data-parallel applications in heterogeneous systems

B Pérez, E Stafford, JL Bosque, R Beivide - The Journal of …, 2017 - Springer
The use of heterogeneous systems in supercomputing is on the rise as they improve both
performance and energy efficiency. However, the programming of these machines requires …

A dynamic multi–objective approach for dynamic load balancing in heterogeneous systems

A Cabrera, A Acosta, F Almeida… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Modern standards in High Performance Computing (HPC) have started to consider energy
consumption and power draw as a limiting factor. New and more complex architectures have …

cCUDA: Effective co-scheduling of concurrent kernels on GPUs

SK Shekofteh, H Noori, M Naghibzadeh… - … on Parallel and …, 2019 - ieeexplore.ieee.org
While GPUs are meantime omnipresent for many scientific and technical computations, they
still continue to evolve as processors. An important recent feature is the ability to execute …

Performance evaluation and analysis of sparse matrix and graph kernels on heterogeneous processors

F Zhang, W Liu, N Feng, J Zhai, X Du - CCF Transactions on High …, 2019 - Springer
Heterogeneous processors integrate very distinct compute resources such as CPUs and
GPUs into the same chip, thus can exploit the advantages and avoid disadvantages of those …

Improving the energy efficiency of SMACOF for multidimensional scaling on modern architectures

F Orts, E Filatovas, G Ortega, O Kurasova… - The Journal of …, 2019 - Springer
The reduction of the dimensionality is of great interest in the context of big data processing.
Multidimensional scaling methods (MDS) are techniques for dimensionality reduction, where …

A heuristic technique to improve energy efficiency with dynamic load balancing

A Cabrera, A Acosta, F Almeida, V Blanco - The Journal of …, 2019 - Springer
Heterogeneous computers require a well-distributed workload to operate efficiently. When
possible, this load balancing procedure should redistribute the workload with minimal …

Adaptive partitioning for iterated sequences of irregular OpenCL kernels

P Huchant, D Barthou… - 2018 30th International …, 2018 - ieeexplore.ieee.org
OpenCL defines a common parallel programming language for all devices, although writing
tasks adapted to the devices, managing communication and load-balancing issues are left …

Annotation inference for safety certification of automatically generated code

E Denney, B Fischer - 21st IEEE/ACM International Conference …, 2006 - ieeexplore.ieee.org
Automated code generation is an enabling technology for model-based software
development and promises many benefits, including higher quality and reduced turn-around …

Split wisely: When work partitioning is energy-optimal on heterogeneous hardware

G Mitra, A Haigh, A Varghese, L Angove… - 2016 IEEE 18th …, 2016 - ieeexplore.ieee.org
Heterogeneous System-on-Chip (SoC) processors are increasingly gaining traction in the
High Performance Computing (HPC) community as alternate building blocks for future …