Optimal parallel algorithms in the binary-forking model

GE Blelloch, JT Fineman, Y Gu, Y Sun - … of the 32nd ACM Symposium on …, 2020 - dl.acm.org
In this paper we develop optimal algorithms in the binary-forking model for a variety of
fundamental problems, including sorting, semisorting, list ranking, tree contraction, range …

Randomized incremental convex hull is highly parallel

GE Blelloch, Y Gu, J Shun, Y Sun - … of the 32nd ACM Symposium on …, 2020 - dl.acm.org
The randomized incremental convex hull algorithm is one of the most practical and important
geometric algorithms in the literature. Due to its simplicity, and the fact that many points or …

Data oblivious algorithms for multicores

V Ramachandran, E Shi - Proceedings of the 33rd ACM Symposium on …, 2021 - dl.acm.org
A data-oblivious algorithm is an algorithm whose memory access pattern is independent of
the input values. We initiate the study of parallel data oblivious algorithms on realistic …

Analysis of work-stealing and parallel cache complexity

Y Gu, Z Napier, Y Sun - Symposium on Algorithmic Principles of Computer …, 2022 - SIAM
Parallelism has become extremely popular over the past decade, and there have been a lot
of new parallel algorithms and software. The randomized work-stealing (RWS) scheduler …

[CARTE][B] Shared-memory parallelism can be simple, fast, and scalable

J Shun - 2017 - books.google.com
Parallelism is the key to achieving high performance in computing. However, writing efficient
and scalable parallel programs is notoriously difficult, and often requires significant …

Flowpools: A lock-free deterministic concurrent dataflow abstraction

A Prokopec, H Miller, T Schlatter, P Haller… - … and Compilers for …, 2013 - Springer
Implementing correct and deterministic parallel programs is challenging. Even though
concurrency constructs exist in popular programming languages to facilitate the task of …

Teaching Parallel Algorithms Using the Binary-Forking Model

GE Blelloch, Y Gu, Y Sun - 2024 IEEE International Parallel and …, 2024 - ieeexplore.ieee.org
In this paper, we share our experience in teaching parallel algorithms with the binary-forking
model. With hardware advances, multicore computers are now ubiquitous. This has created …

Performance of Text-Independent Automatic Speaker Recognition on a Multicore System

R Kouatly, TA Khan - Tsinghua Science and Technology, 2023 - ieeexplore.ieee.org
This paper studies a high-speed text-independent Automatic Speaker Recognition (ASR)
algorithm based on a multicore system's Gaussian Mixture Model (GMM). The high speech …

[PDF][PDF] Write-efficient Algorithms

Y Gu - 2018 - reports-archive.adm.cs.cmu.edu
New non-volatile memory (NVM) technologies are projected to become the dominant type of
main memory in the near future. They promise byteaddressability, good read latencies, and …

A new method of MHP analysis for languages with dynamic barriers

S Joshi, RK Shyamasundar… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
May-happen-in-parallel analysis is a very important analysis which enables several
optimizations in parallel programs. Most of the work on MHP analysis has used forward flow …