Adaptive parallel applications: from shared memory architectures to fog computing (2002–2022)

G Galante, R da Rosa Righi - Cluster Computing, 2022 - Springer
The evolution of parallel architectures points to dynamic environments where the number of
available resources or configurations may vary during the execution of applications. This …

Artemis: automatic runtime tuning of parallel execution parameters using machine learning

C Wood, G Georgakoudis, D Beckingsale… - … Conference, ISC High …, 2021 - Springer
Portable parallel programming models provide the potential for high performance and
productivity, however they come with a multitude of runtime parameters that can have …

Adaptive scheduling of multiprogrammed dynamic-multithreading applications

Z Wang, C Xu, K Agrawal, J Li - Journal of Parallel and Distributed …, 2022 - Elsevier
Modern parallel platforms, such as clouds or servers, are often shared among many different
jobs. However, existing parallel programming runtime systems are designed and optimized …

iMLBench: A machine learning benchmark suite for CPU-GPU integrated architectures

C Zhang, F Zhang, X Guo, B He… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Utilizing heterogeneous accelerators, especially GPUs, to accelerate machine learning
tasks has shown to be a great success in recent years. GPUs bring huge performance …

Bandwidth-aware page placement in numa

D Gureya, J Neto, R Karimi, J Barreto… - 2020 IEEE …, 2020 - ieeexplore.ieee.org
Page placement is a critical problem for memory-intensive applications running on a shared-
memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of …

An Exploration of Global Optimization Strategies for Autotuning OpenMP-based Codes

G Bolet, G Georgakoudis, K Parasyris… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
Automatic parameter tuning of parallel codes is ubiquitous in today's HPC environments
where the performance portability of said codes is expected to keep pace with the perpetual …

A low-latency and flexible TDM NoC for strong isolation in security-critical systems

MG Alonso, J Flich, M Turki… - 2019 IEEE 13th …, 2019 - ieeexplore.ieee.org
Shared security-critical systems are typically organized as a set of domains that must be kept
separate. The network-on-chip (NoC) is key to delivering strong domain isolation, since …

Online learning for dynamic control of openmp workloads

M Mirka, G Sassatelli, A Gamatié - 2020 9th International …, 2020 - ieeexplore.ieee.org
Optimizing energy-efficiency of modern multicore compute systems through online control is
often regarded as both promising and challenging. In this paper, we propose a dynamic …

Resource Allocation for Data-Intensive Services in the Cloud

DD Gureya - 2021 - diva-portal.org
Cloud computing has become ubiquitous due to its resource flexibility and cost efficiency.
Resource flexibility allows Cloud users to elastically scale their Cloud resources, for …

Interference analysis and resource management in server processors: from HPC to cloud computing

L Pons Escat - 2023 - riunet.upv.es
[EN] One of the main concerns of today's data centers is to maximize server utilization. In
each server processor, multiple applications are executed concurrently, increasing resource …