Trends in data locality abstractions for HPC systems

D Unat, A Dubey, T Hoefler, J Shalf… - … on Parallel and …, 2017 - ieeexplore.ieee.org
The cost of data movement has always been an important concern in high performance
computing (HPC) systems. It has now become the dominant factor in terms of both energy …

DASH: A C++ PGAS library for distributed data structures and parallel algorithms

K Fürlinger, T Fuchs… - 2016 IEEE 18th …, 2016 - ieeexplore.ieee.org
We present DASH, a C++ template library that offers distributed data structures and parallel
algorithms and implements a compiler-free PGAS (partitioned global address space) …

[PDF][PDF] Programming abstractions for data locality

A Tate, A Kamil, A Dubey, A Groblinger, B Chamberlain… - 2014 - repository.kaust.edu.sa
Programming Abstractions for Data Locality Page 1 Programming Abstractions for Data Locality
Item Type Technical Report Authors Tate, Adrian;Kamil, Amir;Dubey, Anshu;Groblinger …

Msl: A synthesis enabled language for distributed implementations

Z Xu, S Kamil, A Solar-Lezama - SC'14: Proceedings of the …, 2014 - ieeexplore.ieee.org
This paper demonstrates how ideas from generative programming and software synthesis
can help support the development of bulk-synchronous distributed memory kernels. These …

Optimizing PGAS overhead in a multi-locale chapel implementation of CoMD

R Haque, D Richards - 2016 PGAS Applications Workshop …, 2016 - ieeexplore.ieee.org
Chapel supports distributed computing with an underlying PGAS memory address space.
While it provides abstractions for writing simple and elegant distributed code, the type …

A local-view array library for partitioned global address space C++ programs

A Kamil, Y Zheng, K Yelick - … on Libraries, Languages, and Compilers for …, 2014 - dl.acm.org
Multidimensional arrays are an important data structure in many scientific applications.
Unfortunately, built-in support for such arrays is inadequate in C++, particularly in the …

3‐D data partitioning for 3‐level perfectly nested loops on heterogeneous distributed systems

E Zarei Zefreh, S Lotfi… - Concurrency and …, 2017 - Wiley Online Library
Nested loops are the largest source of parallelism in many data‐parallel scientific
applications. Heterogeneous distributed systems are popular computing platforms for data …

Asynchronous nested parallelism for dynamic applications in distributed memory

I Papadopoulos, N Thomas, A Fidel, D Hoxha… - … and Compilers for …, 2016 - Springer
Nested parallelism is of increasing interest for both expressivity and performance. Many
problems are naturally expressed with this divide-and-conquer software design approach. In …

Integrating SkePU's algorithmic skeletons with GPI on a cluster

J Almqvist - 2022 - diva-portal.org
As processors' clock-speed flattened out in the early 2000s, multi-core processors became
more prevalent and so did parallel programming. However this programming paradigm …

Endpoint security in networks: An openmp approach for increasing malware detection speed

I Forain, R de Oliveira Albuquerque… - Symmetry, 2017 - mdpi.com
Increasingly sophisticated antivirus (AV) software and the growing amount and complexity of
malware demand more processing power from personal computers, specifically from the …