TANGO: re-thinking quantization for graph neural network training on GPUs

S Chen, D Zheng, C Ding, C Huan, Y Ji… - Proceedings of the …, 2023 - dl.acm.org
Graph learning is becoming increasingly popular due to its superior performance in tackling
many grand challenges. While quantization is widely used to accelerate Graph Neural …

[HTML][HTML] Benchmarking Big Data Systems: Performance and Decision-Making Implications in Emerging Technologies

L Theodorakopoulos, A Karras, A Theodoropoulou… - Technologies, 2024 - mdpi.com
Systems for graph processing are a key enabler for insights from large-scale graphs that are
critical to many new advanced technologies such as Artificial Intelligence, Internet of Things …

Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases

GF Oliveira, A Boroumand, S Ghose… - 2022 IEEE Computer …, 2022 - ieeexplore.ieee.org
Today's computing systems require moving data back-and-forth between computing
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …

How to Fit the SCC Algorithm Efficiently into Distributed Graph Iterative Computation

X Sun, W Wang, T Huang - 2024 IEEE 48th Annual Computers …, 2024 - ieeexplore.ieee.org
This paper reviews the sequential, parallel and distributed implementations of strongly
connected component algorithms, and analyzes the challenges of each implementation in …

High-Performance Domain-Specific Systems for Graph and Machine Learning Workloads

J Chen - 2024 - search.proquest.com
Graph-structure data is prevalent because of its ability to capture relations between real-
world entities. However, graph data analyzing applications, including traditional and …

Accelerating Neural Network Training with Processing-in-Memory GPU

X Fei, J Han, J Huang, W Zheng… - 2022 22nd IEEE …, 2022 - ieeexplore.ieee.org
Processing-in-memory (PIM) architecture is promising for accelerating deep neural network
(DNN) training due to its low-latency and energy-efficient data movement between …