Partitioned global address space languages

M De Wael, S Marr, B De Fraine… - ACM Computing …, 2015 - dl.acm.org
The Partitioned Global Address Space (PGAS) model is a parallel programming model that
aims to improve programmer productivity while at the same time aiming for high …

{FaRM}: Fast remote memory

A Dragojević, D Narayanan, M Castro… - 11th USENIX Symposium …, 2014 - usenix.org
We describe the design and implementation of FaRM, a new main memory distributed
computing platform that exploits RDMA to improve both latency and throughput by an order …

[책][B] Parallel programming

T Rauber, G Rünger - 2013 - Springer
Innovations in hardware architecture, such as hyper-threading or multicore processors,
make parallel computing resources available for computer systems in different areas …

Evaluating mapreduce for multi-core and multiprocessor systems

C Ranger, R Raghuraman, A Penmetsa… - 2007 IEEE 13th …, 2007 - ieeexplore.ieee.org
This paper evaluates the suitability of the MapReduce model for multi-core and multi-
processor systems. MapReduce was created by Google for application development on data …

Transactional memory: An overview

T Harris, A Cristal, OS Unsal, E Ayguade… - IEEE micro, 2007 - ieeexplore.ieee.org
Writing applications that benefit from the massive computational power of future multicore
chip multiprocessors will not be an easy task for mainstream programmers accustomed to …

[책][B] Semantics engineering with PLT Redex

M Felleisen, RB Findler, M Flatt - 2009 - books.google.com
The first comprehensive presentation of reduction semantics in one volume, and the first tool
set for such forms of semantics. This text is the first comprehensive presentation of reduction …

Sequoia: Programming the memory hierarchy

K Fatahalian, DR Horn, TJ Knight, L Leem… - Proceedings of the …, 2006 - dl.acm.org
We present Sequoia, a programming language designed to facilitate the development of
memory hierarchy aware parallel programs that remain portable across modern machines …

Efficient distributed memory management with RDMA and caching

Q Cai, W Guo, H Zhang, D Agrawal, G Chen… - Proceedings of the …, 2018 - dl.acm.org
Recent advancements in high-performance networking interconnect significantly narrow the
performance gap between intra-node and inter-node communications, and open up …

Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers

N Maruyama, T Nomura, K Sato… - Proceedings of 2011 …, 2011 - dl.acm.org
This paper proposes a compiler-based programming framework that automatically translates
user-written structured grid code into scalable parallel implementation code for GPU …

Merge: a programming model for heterogeneous multi-core systems

MD Linderman, JD Collins, H Wang… - ACM SIGOPS operating …, 2008 - dl.acm.org
In this paper we propose the Merge framework, a general purpose programming model for
heterogeneous multi-core systems. The Merge framework replaces current ad hoc …