Partitioned global address space languages
The Partitioned Global Address Space (PGAS) model is a parallel programming model that
aims to improve programmer productivity while at the same time aiming for high …
aims to improve programmer productivity while at the same time aiming for high …
{FaRM}: Fast remote memory
We describe the design and implementation of FaRM, a new main memory distributed
computing platform that exploits RDMA to improve both latency and throughput by an order …
computing platform that exploits RDMA to improve both latency and throughput by an order …
[책][B] Parallel programming
T Rauber, G Rünger - 2013 - Springer
Innovations in hardware architecture, such as hyper-threading or multicore processors,
make parallel computing resources available for computer systems in different areas …
make parallel computing resources available for computer systems in different areas …
Evaluating mapreduce for multi-core and multiprocessor systems
C Ranger, R Raghuraman, A Penmetsa… - 2007 IEEE 13th …, 2007 - ieeexplore.ieee.org
This paper evaluates the suitability of the MapReduce model for multi-core and multi-
processor systems. MapReduce was created by Google for application development on data …
processor systems. MapReduce was created by Google for application development on data …
Transactional memory: An overview
Writing applications that benefit from the massive computational power of future multicore
chip multiprocessors will not be an easy task for mainstream programmers accustomed to …
chip multiprocessors will not be an easy task for mainstream programmers accustomed to …
[책][B] Semantics engineering with PLT Redex
The first comprehensive presentation of reduction semantics in one volume, and the first tool
set for such forms of semantics. This text is the first comprehensive presentation of reduction …
set for such forms of semantics. This text is the first comprehensive presentation of reduction …
Sequoia: Programming the memory hierarchy
K Fatahalian, DR Horn, TJ Knight, L Leem… - Proceedings of the …, 2006 - dl.acm.org
We present Sequoia, a programming language designed to facilitate the development of
memory hierarchy aware parallel programs that remain portable across modern machines …
memory hierarchy aware parallel programs that remain portable across modern machines …
Efficient distributed memory management with RDMA and caching
Recent advancements in high-performance networking interconnect significantly narrow the
performance gap between intra-node and inter-node communications, and open up …
performance gap between intra-node and inter-node communications, and open up …
Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers
N Maruyama, T Nomura, K Sato… - Proceedings of 2011 …, 2011 - dl.acm.org
This paper proposes a compiler-based programming framework that automatically translates
user-written structured grid code into scalable parallel implementation code for GPU …
user-written structured grid code into scalable parallel implementation code for GPU …
Merge: a programming model for heterogeneous multi-core systems
In this paper we propose the Merge framework, a general purpose programming model for
heterogeneous multi-core systems. The Merge framework replaces current ad hoc …
heterogeneous multi-core systems. The Merge framework replaces current ad hoc …