- Academic Search

M Diener, EHM Cruz, MAZ Alves, POA Navaux… - ACM Computing …, 2016 - dl.acm.org

Shared memory architectures have recently experienced a large increase in thread-level
parallelism, leading to complex memory hierarchies with multiple cache memory levels and …

Uložit Citovat Počet citací tohoto článku: 54 Související články Všechny verze (počet: 6)

Characterizing communication and page usage of parallel applications for thread and data map**

M Diener, EHM Cruz, LL Pilla, F Dupros… - Performance …, 2015 - Elsevier

The parallelism in shared-memory systems has increased significantly with the advent and
evolution of multicore processors. Current systems include several multicore and …

Uložit Citovat Počet citací tohoto článku: 78 Související články Všechny verze (počet: 6)

[Free GPT-4]

[PDF] academia.edu

kMAF: Automatic kernel-level management of thread and data affinity

M Diener, EHM Cruz, POA Navaux, A Busse… - Proceedings of the 23rd …, 2014 - dl.acm.org

One of the main challenges for parallel architectures is the increasing complexity of the
memory hierarchy, which consists of several levels of private and shared caches, as well as …

Uložit Citovat Počet citací tohoto článku: 58 Související články Všechny verze (počet: 8)

[Free GPT-4]

[PDF] psu.edu

Compiler support for selective page migration in NUMA architectures

G Piccoli, HN Santos, RE Rodrigues, C Pousa… - Proceedings of the 23rd …, 2014 - dl.acm.org

Current high-performance multicore processors provide users with a non-uniform memory
access model (NUMA). These systems perform better when threads access data on memory …

Uložit Citovat Počet citací tohoto článku: 53 Související články Všechny verze (počet: 8)

[Free GPT-4]

[PDF] researchgate.net

Locality vs. balance: Exploring data map** policies on numa systems

M Diener, EHM Cruz… - 2015 23rd Euromicro …, 2015 - ieeexplore.ieee.org

In parallel architectures that have a Non-Uniform Memory Access (NUMA) behavior, the
map** of memory pages to NUMA nodes influences the performance of parallel …

Uložit Citovat Počet citací tohoto článku: 44 Související články Všechny verze (počet: 5)

[Free GPT-4]

[PDF] academia.edu

Kernel-based thread and data map** for improved memory affinity

M Diener, EHM Cruz, MAZ Alves… - … on Parallel and …, 2015 - ieeexplore.ieee.org

Reducing the cost of memory accesses, both in terms of performance and energy
consumption, is a major challenge in shared-memory architectures. Modern systems have …

Uložit Citovat Počet citací tohoto článku: 34 Související články Všechny verze (počet: 6)

Using machine learning to optimize graph execution on numa machines

HMG de A. Rocha, J Schwarzrock… - Proceedings of the 59th …, 2022 - dl.acm.org

This paper proposes PredG, a Machine Learning framework to enhance the graph
processing performance by finding the ideal thread and data map** on NUMA systems …

Uložit Citovat Počet citací tohoto článku: 8 Související články

Boosting graph analytics by tuning threads and data affinity on numa systems

HMGA Rocha, J Schwarzrock… - 2021 29th Euromicro …, 2021 - ieeexplore.ieee.org

The execution of large real-world graphs, such as web searches and social networks, has
been boosting by modern HPC systems. However, their irregular communication patterns …

Uložit Citovat Počet citací tohoto článku: 12 Související články Všechny verze (počet: 2)

Effective exploration of thread throttling and thread/page map** on numa systems

J Schwarzrock, HMGA Rocha… - 2020 IEEE 22nd …, 2020 - ieeexplore.ieee.org

NUMA systems have become commonly used in HPC. However, to fully take advantage of
these systems, the right thread-to-core allocation and page placement are essential. On top …

Uložit Citovat Počet citací tohoto článku: 15 Související články Všechny verze (počet: 3)

[Free GPT-4]

[PDF] ufpr.br

Dynamic thread map** of shared memory applications by exploiting cache coherence protocols

EHM Cruz, M Diener, MAZ Alves… - Journal of Parallel and …, 2014 - Elsevier

In current computer architectures, the communication performance between threads varies
depending on the memory hierarchy. This performance difference must be considered when …

Uložit Citovat Počet citací tohoto článku: 32 Související články Všechny verze (počet: 8)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Communication-based map** using shared pages

Affinity-based thread and data map** in shared memory systems

Characterizing communication and page usage of parallel applications for thread and data map**

kMAF: Automatic kernel-level management of thread and data affinity

Compiler support for selective page migration in NUMA architectures

Locality vs. balance: Exploring data map** policies on numa systems

Kernel-based thread and data map** for improved memory affinity

Using machine learning to optimize graph execution on numa machines

Boosting graph analytics by tuning threads and data affinity on numa systems

Effective exploration of thread throttling and thread/page map** on numa systems

Dynamic thread map** of shared memory applications by exploiting cache coherence protocols