Using meta-heuristics and machine learning for software optimization of parallel computing systems: a systematic literature review

S Memeti, S Pllana, A Binotto, J Kołodziej, I Brandic - Computing, 2019‏ - Springer
While modern parallel computing systems offer high performance, utilizing these powerful
computing resources to the highest possible extent demands advanced knowledge of …

A review of machine learning and meta-heuristic methods for scheduling parallel computing systems

S Memeti, S Pllana, A Binotto, J Kołodziej… - Proceedings of the …, 2018‏ - dl.acm.org
Optimized software execution on parallel computing systems demands consideration of
many parameters at run-time. Determining the optimal set of parameters in a given …

Adaptive load balancing based on machine learning for iterative parallel applications

CRAV Oikawa, V Freitas, M Castro… - 2020 28th Euromicro …, 2020‏ - ieeexplore.ieee.org
The performance of irregular scientific applications can be easily affected by an uneven
distribution of work among the computing resources. In this context, Load Balancing (LB) …

Client-side reconstruction of composite mementos using serviceworker

S Alam, M Kelly, MC Weigle… - 2017 ACM/IEEE Joint …, 2017‏ - ieeexplore.ieee.org
We use the ServiceWorker (SW) API to intercept HTTP requests for embedded resources
and reconstruct Composite Mementos without the need for conventional URL rewriting …

Adaptive thread map** strategies for transactional memory applications

M Castro, LFW Góes, JF Méhaut - Journal of Parallel and Distributed …, 2014‏ - Elsevier
Transactional Memory (TM) is a programmer friendly alternative to traditional lock-based
concurrency. Although it intends to simplify concurrent programming, the performance of the …

[PDF][PDF] Evaluating CPU and memory affinity for numerical scientific multithreaded benchmarks on multi-cores

CP Ribeiro, M Castro… - … Journal on Computer …, 2012‏ - iadisportal.org
Modern multi-core platforms feature complex topologies with different cache levels and
hierarchical memory subsystems. Consequently, thread and data placement become crucial …

Automatic skeleton-driven memory affinity for transactional worklist applications

LFW Góes, CP Ribeiro, M Castro, JF Méhaut… - International journal of …, 2014‏ - Springer
Memory affinity has become a key element to achieve scalable performance on multi-core
platforms. Mechanisms such as thread scheduling, page allocation and cache prefetching …
[Free GPT-4]
[DeepSeek]
EHM Cruz, M Diener… - … and Computation: Practice …, 2015‏ - Wiley Online Library
Threads of parallel applications need to communicate in order to fulfill their tasks. The
communication performance between the cores in modern multi‐core architectures differs …