Affinity-based thread and data map** in shared memory systems

M Diener, EHM Cruz, MAZ Alves, POA Navaux… - ACM Computing …, 2016 - dl.acm.org
Shared memory architectures have recently experienced a large increase in thread-level
parallelism, leading to complex memory hierarchies with multiple cache memory levels and …

memif Towards Programming Heterogeneous Memory Asynchronously

FX Lin, X Liu - ACM SIGPLAN Notices, 2016 - dl.acm.org
To harness a heterogeneous memory hierarchy, it is advantageous to integrate application
knowledge in guiding frequent memory move, ie, replicating or migrating virtual memory …

Modeling and optimizing numa effects and prefetching with machine learning

I Sánchez Barrera, D Black-Schaffer, M Casas… - Proceedings of the 34th …, 2020 - dl.acm.org
Both NUMA thread/data placement and hardware prefetcher configuration have significant
impacts on HPC performance. Optimizing both together leads to a large and complex design …

DR-BW: identifying bandwidth contention in NUMA architectures with supervised learning

H Xu, S Wen, A Gimenez, T Gamblin… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Non-Uniform Memory Access (NUMA) architectures are widely used in mainstream multi-
socket computer systems to scale memory bandwidth. Without a NUMA-aware design …

Numamma: Numa memory analyzer

F Trahay, M Selva, L Morel, K Marquet - Proceedings of the 47th …, 2018 - dl.acm.org
Non Uniform Memory Access (NUMA) architectures are nowadays common for running High-
Performance Computing (HPC) applications. In such architectures, several distinct physical …

Adapt burstable containers to variable CPU resources

H Huang, Y Zhao, J Rao, S Wu, H **… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
In the age of the cloud-native, container technology, referred as OS-level virtualization, is
increasingly adopted to deploy cloud applications. Compared with virtual machines …

Data and thread placement in numa architectures: A statistical learning approach

N Denoyelle, B Goglin, E Jeannot… - Proceedings of the 48th …, 2019 - dl.acm.org
Nowadays, NUMA architectures are common in compute-intensive systems. Achieving high
performance for multi-threaded application requires both a careful placement of threads on …

Reducing data movement on large shared memory systems by exploiting computation dependencies

I Sánchez Barrera, M Moretó, E Ayguadé… - Proceedings of the …, 2018 - dl.acm.org
Shared memory systems are becoming increasingly complex as they typically integrate
several storage devices. That brings different access latencies or bandwidth rates …

Swing to SWT and back: Patterns for API migration by wrap**

TT Bartolomei, K Czarnecki… - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
Evolving requirements may necessitate API migration-re-engineering an application to
replace its dependence on one API with the dependence on another API for the same …

Locality vs. balance: Exploring data map** policies on numa systems

M Diener, EHM Cruz… - 2015 23rd Euromicro …, 2015 - ieeexplore.ieee.org
In parallel architectures that have a Non-Uniform Memory Access (NUMA) behavior, the
map** of memory pages to NUMA nodes influences the performance of parallel …