Programming parallel algorithms

GE Blelloch - Communications of the ACM, 1996 - dl.acm.org
parallel algorithftls. Researchers have developed efficient parallel algorithms to solve most
problems for which efficient sequential solutions are known. Although some ofthese …

Eliminating array bound checking through dependent types

H **, F Pfenning - Proceedings of the ACM SIGPLAN 1998 conference …, 1998 - dl.acm.org
We present a type-based approach to eliminating array bound checking and list tag
checking by conservatively extending Standard ML with a restricted form of dependent …

CoreDet: A compiler and runtime system for deterministic multithreaded execution

T Bergan, O Anderson, J Devietti, L Ceze… - Proceedings of the …, 2010 - dl.acm.org
The behavior of a multithreaded program does not depend only on its inputs. Scheduling,
memory reordering, timing, and low-level hardware effects all introduce nondeterminism in …

Carbon: architectural support for fine-grained parallelism on chip multiprocessors

S Kumar, CJ Hughes, A Nguyen - ACM SIGARCH Computer Architecture …, 2007 - dl.acm.org
Chip multiprocessors (CMPs) are now commonplace, and the number of cores on a CMP is
likely to grow steadily. However, in order to harness the additional compute resources of a …

TIL: A type-directed optimizing compiler for ML

D Tarditi, G Morrisett, P Cheng, C Stone, R Harper… - ACM Sigplan …, 1996 - dl.acm.org
We are investigating a new approach to compiling Standard ML (SML) based on four key
technologies: interwional polymorphism [23], nearlg tag-free garbage collection[12, 46, 34] …

Getting to the point: index sets and parallelism-preserving autodiff for pointful array programming

A Paszke, DD Johnson, D Duvenaud… - Proceedings of the …, 2021 - dl.acm.org
We present a novel programming language design that attempts to combine the clarity and
safety of high-level functional languages with the efficiency and parallelism of low-level …

Provably efficient scheduling for languages with fine-grained parallelism

GE Blelloch, PB Gibbons, Y Matias - Journal of the ACM (JACM), 1999 - dl.acm.org
Many high-level parallel programming languages allow for fine-grained parallelism. As in
the popular work-time framework for parallel algorithm design, programs written in such …

Type-directed scheduling of streaming accelerators

D Durst, M Feldman, D Huff, D Akeley, R Daly… - Proceedings of the 41st …, 2020 - dl.acm.org
Designing efficient, application-specialized hardware accelerators requires assessing trade-
offs between a hardware module's performance and resource requirements. To facilitate …

[PDF][PDF] Deterministic Process Groups in {dOS}

T Bergan, N Hunt, L Ceze, SD Gribble - 9th USENIX Symposium on …, 2010 - usenix.org
Current multiprocessor systems execute parallel and concurrent software
nondeterministically: even when given precisely the same input, two executions of the same …

Implementation of a portable nested data-parallel language

GE Blelloch, JC Hardwick, S Chatterjee… - ACM Sigplan …, 1993 - dl.acm.org
This paper gives an overview of the implementation of NESL, a portable nested data-parallel
language. This language and its implementation are the first to fully support nested data …