A comprehensive exploration of languages for parallel computing
Software-intensive systems in most domains, from autonomous vehicles to health, are
becoming predominantly parallel to efficiently manage large amount of data in short (even …
becoming predominantly parallel to efficiently manage large amount of data in short (even …
Efficient scheduling policies for {Microsecond-Scale} tasks
Datacenter operators today strive to support microsecond-latency applications while also
using their limited CPU resources as efficiently as possible. To achieve this, several recent …
using their limited CPU resources as efficiently as possible. To achieve this, several recent …
Openstream: Expressiveness and data-flow compilation of openmp streaming programs
We present OpenStream, a data-flow extension of OpenMP to express dynamic dependent
tasks. The language supports nested task creation, modular composition, variable and …
tasks. The language supports nested task creation, modular composition, variable and …
Tapir: Embedding fork-join parallelism into LLVM's intermediate representation
This paper explores how fork-join parallelism, as supported by concurrency platforms such
as Cilk and OpenMP, can be embedded into a compiler's intermediate representation (IR) …
as Cilk and OpenMP, can be embedded into a compiler's intermediate representation (IR) …
Runnemede: An architecture for ubiquitous high-performance computing
DARPA's Ubiquitous High-Performance Computing (UHPC) program asked researchers to
develop computing systems capable of achieving energy efficiencies of 50 GOPS/Watt …
develop computing systems capable of achieving energy efficiencies of 50 GOPS/Watt …
Scalable and precise dynamic datarace detection for structured parallelism
Existing dynamic race detectors suffer from at least one of the following three limitations:(i)
space overhead per memory location grows linearly with the number of parallel threads [13] …
space overhead per memory location grows linearly with the number of parallel threads [13] …
Realm: An event-based low-level runtime for distributed memory architectures
We present Realm, an event-based runtime system for heterogeneous, distributed memory
machines. Realm is fully asynchronous: all runtime actions are non-blocking. Realm …
machines. Realm is fully asynchronous: all runtime actions are non-blocking. Realm …
Integrating asynchronous task parallelism with MPI
Effective combination of inter-node and intra-node parallelism is recognized to be a major
challenge for future extreme-scale systems. Many researchers have demonstrated the …
challenge for future extreme-scale systems. Many researchers have demonstrated the …
Reliable and cooperative target tracking based on WSN and WiFi in indoor wireless networks
J Luo, Z Zhang, C Liu, H Luo - IEEE Access, 2018 - ieeexplore.ieee.org
Indoor localization and tracking have attracted growing attention because of its widely
application for indoor location-based services (LBSs). However, for indoor localization, only …
application for indoor location-based services (LBSs). However, for indoor localization, only …
Parallel Objects for Multicores: A Glimpse at the Parallel Language Encore
The age of multi-core computers is upon us, yet current programming languages, typically
designed for single-core computers and adapted post hoc for multi-cores, remain tied to the …
designed for single-core computers and adapted post hoc for multi-cores, remain tied to the …