- Academic Search

Scaling irregular applications through data aggregation and software multithreading

Rechercher parmi les articles qui s'y rapportent

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Acceleration of graph neural network-based prediction models in chemistry via co-design optimization on intelligence processing units

H Helal, J Firoz, JA Bilbrey, H Sprueill… - Journal of Chemical …, 2024 - ACS Publications

Atomic structure prediction and associated property calculations are the bedrock of chemical
physics. Since high-fidelity ab initio modeling techniques for computing the structure and …

Enregistrer Citer Cité 5 fois Autres articles Les 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org Full View

Asynchronous Memory Access Unit: Exploiting Massive Parallelism for Far Memory Access

L Wang, X Zhang, S Wang, Z Jiang, T Lu… - ACM Transactions on …, 2024 - dl.acm.org

The growing memory demands of modern applications have driven the adoption of far
memory technologies in data centers to provide cost-effective, high-capacity memory …

Enregistrer Citer Cité 1 fois Autres articles Les 4 versions Free GPT-4 DeepSeek

In-memory graph databases for web-scale data

VG Castellana, A Morari, J Weaver, A Tumeo… - Computer, 2015 - ieeexplore.ieee.org

In-Memory Graph Databases for Web-Scale Data Page 1 24 COMPUTER PUBLISHED BY THE
IEEE COMPUTER SOCIETY 0018-9162/15/$31.00 © 2015 IEEE COVER FEATURE BIG DATA …

Enregistrer Citer Cité 33 fois Autres articles Les 4 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Itoyori: Reconciling global address space and global fork-join task parallelism

S Shiina, K Taura - Proceedings of the International Conference for High …, 2023 - dl.acm.org

This paper introduces Itoyori, a task-parallel runtime system designed to tackle the
challenge of scaling task parallelism (more specifically, nested fork-join parallelism) beyond …

Enregistrer Citer Cité 1 fois Autres articles Les 4 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] audentia-gestion.fr

Caching puts and gets in a PGAS language runtime

MP Ferguson, D Buettner - 2015 9th International Conference …, 2015 - ieeexplore.ieee.org

We investigated a software cache for PGAS PUT and GET operations. The cache is
implemented as a software write-back cache with dirty bits, local memory consistency …

Enregistrer Citer Cité 21 fois Autres articles Les 9 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] pnnl.gov

Shad: The scalable high-performance algorithms and data-structures library

VG Castellana, M Minutoli - 2018 18th IEEE/ACM International …, 2018 - ieeexplore.ieee.org

The unprecedented amount of data that needs to be processed in emerging data analytics
applications poses novel challenges to industry and academia. Scalability and high …

Enregistrer Citer Cité 13 fois Autres articles Les 5 versions Free GPT-4 DeepSeek

Practical distributed programming in c++

M Drocco, VG Castellana, M Minutoli - Proceedings of the 29th …, 2020 - dl.acm.org

The need for coupling high performance with productivity is steering the recent evolution of
the C++ language where low-level aspects of parallel and distributed computing are now …

Enregistrer Citer Cité 9 fois Autres articles Les 2 versions Free GPT-4 DeepSeek

Graphine: Programming graph-parallel computation of large natural graphs for multicore clusters

J Yan, G Tan, Z Mo, N Sun - IEEE Transactions on Parallel and …, 2015 - ieeexplore.ieee.org

Graph-parallel computation has become a crucial component in emerging applications of
web search, data analytics and machine learning. In practice, most graphs derived from real …

Enregistrer Citer Cité 18 fois Autres articles Les 3 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] wisc.edu

Gravel: Fine-grain gpu-initiated network messages

MS Orr, S Che, BM Beckmann, M Oskin… - Proceedings of the …, 2017 - dl.acm.org

Distributed systems incorporate GPUs because they provide massive parallelism in an
energy-efficient manner. Unfortunately, existing programming models make it difficult to …

Enregistrer Citer Cité 10 fois Autres articles Les 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] osti.gov

Extending openshmem with aggregation support for improved message rate performance

A Welch, O Hernandez, S Poole - European Conference on Parallel …, 2023 - Springer

OpenSHMEM is a highly efficient one-sided communication API that implements the PGAS
parallel programming model, and is known for its low latency communication operations that …

Enregistrer Citer Cité 1 fois Autres articles Les 5 versions Free GPT-4 DeepSeek

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Scaling irregular applications through data aggregation and software multithreading

Acceleration of graph neural network-based prediction models in chemistry via co-design optimization on intelligence processing units

Asynchronous Memory Access Unit: Exploiting Massive Parallelism for Far Memory Access

In-memory graph databases for web-scale data

Itoyori: Reconciling global address space and global fork-join task parallelism

Caching puts and gets in a PGAS language runtime

Shad: The scalable high-performance algorithms and data-structures library

Practical distributed programming in c++

Graphine: Programming graph-parallel computation of large natural graphs for multicore clusters

Gravel: Fine-grain gpu-initiated network messages

Extending openshmem with aggregation support for improved message rate performance