The evolution of distributed systems for graph neural networks and their origin in graph processing and deep learning: A survey

J Vatter, R Mayer, HA Jacobsen - ACM Computing Surveys, 2023 - dl.acm.org
Graph neural networks (GNNs) are an emerging research field. This specialized deep
neural network architecture is capable of processing graph structured data and bridges the …

[PDF][PDF] MallobSat:: Scalable SAT Solving by Clause Sharing

D Schreiber, P Sanders - Journal of Artificial Intelligence Research, 2024 - jair.org
SAT solving in large distributed environments has previously led to some famous results and
to impressive speedups for selected inputs. However, in terms of general-purpose SAT …

Distributed memory, GPU accelerated Fock construction for hybrid, Gaussian basis density functional theory

DB Williams-Young, A Asadchev… - The Journal of …, 2023 - pubs.aip.org
With the growing reliance of modern supercomputers on accelerator-based architecture
such a graphics processing units (GPUs), the development and optimization of electronic …

Engineering in-place (shared-memory) sorting algorithms

M Axtmann, S Witt, D Ferizovic, P Sanders - ACM Transactions on …, 2022 - dl.acm.org
We present new sequential and parallel sorting algorithms that now represent the fastest
known techniques for a wide range of input sizes, input distributions, data types, and …

High-quality shared-memory graph partitioning

Y Akhremtsev, P Sanders… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Partitioning graphs into blocks of roughly equal size such that few edges run between blocks
is a frequently needed operation in processing graphs. Recently, size, variety, and structural …

Efficient step** algorithms and implementations for parallel shortest paths

X Dong, Y Gu, Y Sun, Y Zhang - … of the 33rd ACM Symposium on …, 2021 - dl.acm.org
The single-source shortest-path (SSSP) problem is a notoriously hard problem in the
parallel context. In practice, the Δ-step** algorithm of Meyer and Sanders has been widely …

Parallel weighted random sampling

L Hübschle-Schneider, P Sanders - ACM Transactions on Mathematical …, 2022 - dl.acm.org
Data structures for efficient sampling from a set of weighted items are an important building
block of many applications. However, few parallel solutions are known. We close many of …

Beyond Throughput and Compression Ratios: Towards High End-to-end Utility of Gradient Compression

W Han, S Vargaftik, M Mitzenmacher, B Karp… - Proceedings of the 23rd …, 2024 - dl.acm.org
Gradient aggregation has long been identified as a major bottleneck in today's large-scale
distributed machine learning training systems. One promising solution to mitigate such …

Methodology of algorithm engineering

J Mendling, H Leopold, H Meyerhenke… - arxiv preprint arxiv …, 2023 - arxiv.org
Research on algorithms has drastically increased in recent years. Various sub-disciplines of
computer science investigate algorithms according to different objectives and standards …

Decentralized online scheduling of malleable NP-hard jobs

P Sanders, D Schreiber - European Conference on Parallel Processing, 2022 - Springer
In this work, we address an online job scheduling problem in a large distributed computing
environment. Each job has a priority and a demand of resources, takes an unknown amount …