Adaptive parallel applications: from shared memory architectures to fog computing (2002–2022)
G Galante, R da Rosa Righi - Cluster Computing, 2022 - Springer
The evolution of parallel architectures points to dynamic environments where the number of
available resources or configurations may vary during the execution of applications. This …
available resources or configurations may vary during the execution of applications. This …
Artemis: automatic runtime tuning of parallel execution parameters using machine learning
Portable parallel programming models provide the potential for high performance and
productivity, however they come with a multitude of runtime parameters that can have …
productivity, however they come with a multitude of runtime parameters that can have …
Adaptive scheduling of multiprogrammed dynamic-multithreading applications
Modern parallel platforms, such as clouds or servers, are often shared among many different
jobs. However, existing parallel programming runtime systems are designed and optimized …
jobs. However, existing parallel programming runtime systems are designed and optimized …
iMLBench: A machine learning benchmark suite for CPU-GPU integrated architectures
C Zhang, F Zhang, X Guo, B He… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Utilizing heterogeneous accelerators, especially GPUs, to accelerate machine learning
tasks has shown to be a great success in recent years. GPUs bring huge performance …
tasks has shown to be a great success in recent years. GPUs bring huge performance …
Bandwidth-aware page placement in numa
Page placement is a critical problem for memory-intensive applications running on a shared-
memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of …
memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of …
An Exploration of Global Optimization Strategies for Autotuning OpenMP-based Codes
G Bolet, G Georgakoudis, K Parasyris… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
Automatic parameter tuning of parallel codes is ubiquitous in today's HPC environments
where the performance portability of said codes is expected to keep pace with the perpetual …
where the performance portability of said codes is expected to keep pace with the perpetual …
A low-latency and flexible TDM NoC for strong isolation in security-critical systems
MG Alonso, J Flich, M Turki… - 2019 IEEE 13th …, 2019 - ieeexplore.ieee.org
Shared security-critical systems are typically organized as a set of domains that must be kept
separate. The network-on-chip (NoC) is key to delivering strong domain isolation, since …
separate. The network-on-chip (NoC) is key to delivering strong domain isolation, since …
Online learning for dynamic control of openmp workloads
M Mirka, G Sassatelli, A Gamatié - 2020 9th International …, 2020 - ieeexplore.ieee.org
Optimizing energy-efficiency of modern multicore compute systems through online control is
often regarded as both promising and challenging. In this paper, we propose a dynamic …
often regarded as both promising and challenging. In this paper, we propose a dynamic …
Resource Allocation for Data-Intensive Services in the Cloud
DD Gureya - 2021 - diva-portal.org
Cloud computing has become ubiquitous due to its resource flexibility and cost efficiency.
Resource flexibility allows Cloud users to elastically scale their Cloud resources, for …
Resource flexibility allows Cloud users to elastically scale their Cloud resources, for …
Interference analysis and resource management in server processors: from HPC to cloud computing
L Pons Escat - 2023 - riunet.upv.es
[EN] One of the main concerns of today's data centers is to maximize server utilization. In
each server processor, multiple applications are executed concurrently, increasing resource …
each server processor, multiple applications are executed concurrently, increasing resource …