DGCL: An efficient communication library for distributed GNN training

Z Cai, X Yan, Y Wu, K Ma, J Cheng, F Yu - Proceedings of the Sixteenth …, 2021 - dl.acm.org
Graph neural networks (GNNs) have gained increasing popularity in many areas such as e-
commerce, social networks and bio-informatics. Distributed GNN training is essential for …

Polygraph: Exposing the value of flexibility for graph processing accelerators

V Dadu, S Liu, T Nowatzki - 2021 ACM/IEEE 48th Annual …, 2021 - ieeexplore.ieee.org
Because of the importance of graph workloads and the limitations of CPUs/GPUs, many
graph processing accelerators have been proposed. The basic approach of prior …

Subway: Minimizing data transfer during out-of-GPU-memory graph processing

AHN Sabet, Z Zhao, R Gupta - … of the Fifteenth European Conference on …, 2020 - dl.acm.org
In many graph-based applications, the graphs tend to grow, imposing a great challenge for
GPU-based graph processing. When the graph size exceeds the device memory capacity …

Emogi: Efficient memory-access for out-of-memory graph-traversal in gpus

SW Min, VS Mailthody, Z Qureshi, J **ong… - arxiv preprint arxiv …, 2020 - arxiv.org
Modern analytics and recommendation systems are increasingly based on graph data that
capture the relations between entities being analyzed. Practical graphs come in huge sizes …

G3 when graph neural networks meet parallel graph processing systems on GPUs

H Liu, S Lu, X Chen, B He - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
This paper demonstrates G3, a framework for Graph Neural Network (GNN) training, tailored
from Graph processing systems on Graphics processing units (GPUs). G3 aims at improving …

Grus: Toward unified-memory-efficient high-performance graph processing on gpu

P Wang, J Wang, C Li, J Wang, H Zhu… - ACM Transactions on …, 2021 - dl.acm.org
Today's GPU graph processing frameworks face scalability and efficiency issues as the
graph size exceeds GPU-dedicated memory limit. Although recent GPUs can over-subscribe …

Depgraph: A dependency-driven accelerator for efficient iterative graph processing

Y Zhang, X Liao, H **, L He, B He… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Many graph processing systems have been recently developed for many-core processors.
However, for iterative graph processing, due to the dependencies between vertices' states …

MG-Join: A scalable join for massively parallel multi-GPU architectures

J Paul, S Lu, B He, CT Lau - … of the 2021 International Conference on …, 2021 - dl.acm.org
The recent scale-up of GPU hardware through the integration of multiple GPUs into a single
machine and the introduction of higher bandwidth interconnects like NVLink 2.0 has …

Glign: Taming misaligned graph traversals in concurrent graph processing

X Yin, Z Zhao, R Gupta - Proceedings of the 28th ACM International …, 2022 - dl.acm.org
In concurrent graph processing, different queries are evaluated on the same graph
simultaneously, sharing the graph accesses via the memory hierarchy. However, different …

[HTML][HTML] Software systems implementation and domain-specific architectures towards graph analytics

H **, H Qi, J Zhao, X Jiang, Y Huang, C Gui… - Intelligent …, 2022 - spj.science.org
Graph analytics, which mainly includes graph processing, graph mining, and graph learning,
has become increasingly important in several domains, including social network analysis …