Partitioned global address space languages

M De Wael, S Marr, B De Fraine… - ACM Computing …, 2015 - dl.acm.org
The Partitioned Global Address Space (PGAS) model is a parallel programming model that
aims to improve programmer productivity while at the same time aiming for high …

Habanero-Java: the new adventures of old X10

V Cavé, J Zhao, J Shirako, V Sarkar - Proceedings of the 9th …, 2011 - dl.acm.org
In this paper, we present the Habanero-Java (HJ) language developed at Rice University as
an extension to the original Java-based definition of the X10 language. HJ includes a …

Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems

GF Diamos, AR Kerr, S Yalamanchili… - Proceedings of the 19th …, 2010 - dl.acm.org
Ocelot is a dynamic compilation framework designed to map the explicitly data parallel
execution model used by NVIDIA CUDA applications onto diverse multithreaded platforms …

Slaw: a scalable locality-aware adaptive work-stealing scheduler for multi-core systems

Y Guo, J Zhao, V Cave, V Sarkar - … of the 15th ACM SIGPLAN Symposium …, 2010 - dl.acm.org
This poster introduces SLAW, a Scalable Locality-aware Adaptive Work-stealing scheduler.
The SLAW features an adaptive task scheduling algorithm combined with a locality-aware …

Trends in data locality abstractions for HPC systems

D Unat, A Dubey, T Hoefler, J Shalf… - … on Parallel and …, 2017 - ieeexplore.ieee.org
The cost of data movement has always been an important concern in high performance
computing (HPC) systems. It has now become the dominant factor in terms of both energy …

Exascale computing trends: Adjusting to the" new normal"'for computer architecture

P Kogge, J Shalf - Computing in Science & Engineering, 2013 - ieeexplore.ieee.org
We now have 20 years of data under our belt about the performance of supercomputers
against at least a single floating-point benchmark from dense linear algebra. Until about …

Extreme heterogeneity 2018-productive computational science in the era of extreme heterogeneity: Report for DOE ASCR workshop on extreme heterogeneity

JS Vetter, R Brightwell, M Gokhale, P McCormick… - 2018 - osti.gov
The 2018 Basic Research Needs Workshop on Extreme Heterogeneity identified five Priority
Research Directions for realizing the capabilities needed to address the challenges posed …

[PDF][PDF] Hierarchical work stealing on manycore clusters

SJ Min, C Iancu, K Yelick - Fifth Conference on Partitioned Global Address …, 2011 - Citeseer
Abstract Partitioned Global Address Space languages like UPC offer a convenient way of
expressing large shared data structures, especially for irregular structures that require …

The locality descriptor: A holistic cross-layer abstraction to express data locality in GPUs

N Vijaykumar, E Ebrahimi, K Hsieh… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Exploiting data locality in GPUs is critical to making more efficient use of the existing caches
and the NUMA-based memory hierarchy expected in future GPUs. While modern GPU …

Hpvm: Heterogeneous parallel virtual machine

M Kotsifakou, P Srivastava, MD Sinclair… - Proceedings of the 23rd …, 2018 - dl.acm.org
We propose a parallel program representation for heterogeneous systems, designed to
enable performance portability across a wide range of popular parallel hardware, including …