TANGO: re-thinking quantization for graph neural network training on GPUs
Graph learning is becoming increasingly popular due to its superior performance in tackling
many grand challenges. While quantization is widely used to accelerate Graph Neural …
many grand challenges. While quantization is widely used to accelerate Graph Neural …
[HTML][HTML] Benchmarking Big Data Systems: Performance and Decision-Making Implications in Emerging Technologies
Systems for graph processing are a key enabler for insights from large-scale graphs that are
critical to many new advanced technologies such as Artificial Intelligence, Internet of Things …
critical to many new advanced technologies such as Artificial Intelligence, Internet of Things …
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
Today's computing systems require moving data back-and-forth between computing
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …
How to Fit the SCC Algorithm Efficiently into Distributed Graph Iterative Computation
X Sun, W Wang, T Huang - 2024 IEEE 48th Annual Computers …, 2024 - ieeexplore.ieee.org
This paper reviews the sequential, parallel and distributed implementations of strongly
connected component algorithms, and analyzes the challenges of each implementation in …
connected component algorithms, and analyzes the challenges of each implementation in …
High-Performance Domain-Specific Systems for Graph and Machine Learning Workloads
J Chen - 2024 - search.proquest.com
Graph-structure data is prevalent because of its ability to capture relations between real-
world entities. However, graph data analyzing applications, including traditional and …
world entities. However, graph data analyzing applications, including traditional and …
Accelerating Neural Network Training with Processing-in-Memory GPU
Processing-in-memory (PIM) architecture is promising for accelerating deep neural network
(DNN) training due to its low-latency and energy-efficient data movement between …
(DNN) training due to its low-latency and energy-efficient data movement between …