A comprehensive exploration of languages for parallel computing

F Ciccozzi, L Addazi, SA Asadollah, B Lisper… - ACM Computing …, 2022 - dl.acm.org
Software-intensive systems in most domains, from autonomous vehicles to health, are
becoming predominantly parallel to efficiently manage large amount of data in short (even …

Efficient scheduling policies for {Microsecond-Scale} tasks

S McClure, A Ousterhout, S Shenker… - … USENIX Symposium on …, 2022 - usenix.org
Datacenter operators today strive to support microsecond-latency applications while also
using their limited CPU resources as efficiently as possible. To achieve this, several recent …

Openstream: Expressiveness and data-flow compilation of openmp streaming programs

A Pop, A Cohen - ACM Transactions on Architecture and Code …, 2013 - dl.acm.org
We present OpenStream, a data-flow extension of OpenMP to express dynamic dependent
tasks. The language supports nested task creation, modular composition, variable and …

Tapir: Embedding fork-join parallelism into LLVM's intermediate representation

TB Schardl, WS Moses, CE Leiserson - Proceedings of the 22nd ACM …, 2017 - dl.acm.org
This paper explores how fork-join parallelism, as supported by concurrency platforms such
as Cilk and OpenMP, can be embedded into a compiler's intermediate representation (IR) …

Runnemede: An architecture for ubiquitous high-performance computing

NP Carter, A Agrawal, S Borkar… - 2013 IEEE 19th …, 2013 - ieeexplore.ieee.org
DARPA's Ubiquitous High-Performance Computing (UHPC) program asked researchers to
develop computing systems capable of achieving energy efficiencies of 50 GOPS/Watt …

Scalable and precise dynamic datarace detection for structured parallelism

R Raman, J Zhao, V Sarkar, M Vechev, E Yahav - Acm Sigplan Notices, 2012 - dl.acm.org
Existing dynamic race detectors suffer from at least one of the following three limitations:(i)
space overhead per memory location grows linearly with the number of parallel threads [13] …

Realm: An event-based low-level runtime for distributed memory architectures

S Treichler, M Bauer, A Aiken - … of the 23rd international conference on …, 2014 - dl.acm.org
We present Realm, an event-based runtime system for heterogeneous, distributed memory
machines. Realm is fully asynchronous: all runtime actions are non-blocking. Realm …

Integrating asynchronous task parallelism with MPI

S Chatterjee, S Tasırlar, Z Budimlic… - 2013 IEEE 27th …, 2013 - ieeexplore.ieee.org
Effective combination of inter-node and intra-node parallelism is recognized to be a major
challenge for future extreme-scale systems. Many researchers have demonstrated the …

Reliable and cooperative target tracking based on WSN and WiFi in indoor wireless networks

J Luo, Z Zhang, C Liu, H Luo - IEEE Access, 2018 - ieeexplore.ieee.org
Indoor localization and tracking have attracted growing attention because of its widely
application for indoor location-based services (LBSs). However, for indoor localization, only …

Parallel Objects for Multicores: A Glimpse at the Parallel Language Encore

S Brandauer, E Castegren, D Clarke… - Formal Methods for …, 2015 - Springer
The age of multi-core computers is upon us, yet current programming languages, typically
designed for single-core computers and adapted post hoc for multi-cores, remain tied to the …