A survey of general‐purpose computation on graphics hardware

JD Owens, D Luebke, N Govindaraju… - Computer graphics …, 2007 - Wiley Online Library
The rapid increase in the performance of graphics hardware, coupled with recent
improvements in its programmability, have made graphics hardware a compelling platform …

The problem with threads

EA Lee - Computer, 2006 - ieeexplore.ieee.org
For concurrent programming to become mainstream, we must discard threads as a
programming model. Nondeterminism should be judiciously and carefully introduced where …

The tao of parallelism in algorithms

K **ali, D Nguyen, M Kulkarni, M Burtscher… - Proceedings of the …, 2011 - dl.acm.org
For more than thirty years, the parallel programming community has used the dependence
graph as the main abstraction for reasoning about and exploiting parallelism in" regular" …

Stateful dataflow multigraphs: A data-centric model for performance portability on heterogeneous architectures

T Ben-Nun, J de Fine Licht, AN Ziogas… - Proceedings of the …, 2019 - dl.acm.org
The ubiquity of accelerators in high-performance computing has driven programming
complexity beyond the skill-set of the average domain scientist. To maintain performance …

A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers

H González‐Vélez, M Leyton - Software: Practice and …, 2010 - Wiley Online Library
Structured parallel programs ought to be conceived as two separate and complementary
entities: computation, which expresses the calculations in a procedural manner, and …

[PDF][PDF] The parallel BGL: A generic library for distributed graph computations

D Gregor, A Lumsdaine - Parallel Object-Oriented Scientific …, 2005 - researchgate.net
This paper presents the Parallel BGL, a generic C++ library for distributed graph
computation. Like the sequential Boost Graph Library (BGL) upon which it is based, the …

[PDF][PDF] Computing foundations and practice for cyber-physical systems: A preliminary report

EA Lee - University of California, Berkeley, Tech. Rep. UCB …, 2007 - eecs.berkeley.edu
Abstract Cyber-Physical Systems (CPS) are integrations of computation and physical
processes. Embedded computers and networks monitor and control the physical processes …

An asymmetric distributed shared memory model for heterogeneous parallel systems

I Gelado, JE Stone, J Cabezas, S Patel… - Proceedings of the …, 2010 - dl.acm.org
Heterogeneous computing combines general purpose CPUs with accelerators to efficiently
execute both sequential control-intensive and data-parallel phases of applications. Existing …

Concepts: Linguistic support for generic programming in C++

D Gregor, J Järvi, J Siek, B Stroustrup… - Proceedings of the 21st …, 2006 - dl.acm.org
Generic programming has emerged as an important technique for the development of highly
reusable and efficient software libraries. In C++, generic programming is enabled by the …

UPC++: A high-performance communication framework for asynchronous computation

J Bachan, SB Baden, S Hofmeyr… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
UPC++ is a C++ library that supports high-performance computation via an asynchronous
communication framework. This paper describes a new incarnation that differs substantially …