Decoupling representation learning and classification for gnn-based anomaly detection

Y Wang, J Zhang, S Guo, H Yin, C Li… - Proceedings of the 44th …, 2021 - dl.acm.org
GNN-based anomaly detection has recently attracted considerable attention. Existing
attempts have thus far focused on jointly learning the node representations and the classifier …

AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures

Z Zheng, X Yang, P Zhao, G Long, K Zhu… - Proceedings of the 27th …, 2022 - dl.acm.org
This work reveals that memory-intensive computation is a rising performance-critical factor in
recent machine learning models. Due to a unique set of new challenges, existing ML …

POCLib: A high-performance framework for enabling near orthogonal processing on compression

F Zhang, J Zhai, X Shen, O Mutlu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Parallel technology boosts data processing in recent years, and parallel direct data
processing on hierarchically compressed documents exhibits great promise. The high …

A survey on agent-based simulation using hardware accelerators

J **ao, P Andelfinger, D Eckhoff, W Cai… - ACM Computing Surveys …, 2019 - dl.acm.org
Due to decelerating gains in single-core CPU performance, computationally expensive
simulations are increasingly executed on highly parallel hardware platforms. Agent-based …

A survey on techniques for cooperative CPU-GPU computing

K Raju, NN Chiplunkar - Sustainable Computing: Informatics and Systems, 2018 - Elsevier
Abstract Graphical Processing Unit provides massive parallelism due to the presence of
hundreds of cores. Usage of GPUs for general purpose computation (GPGPU) has resulted …

CompressDB: Enabling efficient compressed data direct processing for various databases

F Zhang, W Wan, C Zhang, J Zhai, Y Chai… - Proceedings of the 2022 …, 2022 - dl.acm.org
In modern data management systems, directly performing operations on compressed data
has been proven to be a big success facing big data problems. These systems have …

Clusterscl: Cluster-aware supervised contrastive learning on graphs

Y Wang, J Zhang, H Li, Y Dong, H Yin, C Li… - Proceedings of the ACM …, 2022 - dl.acm.org
We study the problem of supervised contrastive (SupCon) learning on graphs. The SupCon
loss has been recently proposed for classification tasks by pulling data points in the same …

Design of a quantization-based dnn delta compression framework for model snapshots and federated learning

H **, D Wu, S Zhang, X Zou, S **… - … on Parallel and …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) have achieved remarkable success in many fields. However,
large-scale DNNs also bring storage costs when storing snapshots for preventing clusters' …

Haspmv: Heterogeneity-aware sparse matrix-vector multiplication on modern asymmetric multicore processors

W Li, H Cheng, Z Lu, Y Lu, W Liu - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Sparse matrix-vector multiplication (SpMV) is a fundamental routine in computational
science and engineering. Its optimization methods on various homogeneous parallel …

Exploring data analytics without decompression on embedded GPU systems

Z Pan, F Zhang, Y Zhou, J Zhai, X Shen… - … on Parallel and …, 2021 - ieeexplore.ieee.org
With the development of computer architecture, even for embedded systems, GPU devices
can be integrated, providing outstanding performance and energy efficiency to meet the …