Sisa: Set-centric instruction set architecture for graph mining on processing-in-memory systems

M Besta, R Kanakagiri, G Kwasniewski… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Simple graph algorithms such as PageRank have been the target of numerous hardware
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …

DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks

GF Oliveira, J Gómez-Luna, L Orosa, S Ghose… - IEEE …, 2021 - ieeexplore.ieee.org
Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …

Graphq: Scalable pim-based graph processing

Y Zhuo, C Wang, M Zhang, R Wang, D Niu… - Proceedings of the …, 2019 - dl.acm.org
Processing-In-Memory (PIM) architectures based on recent technology advances (eg,
Hybrid Memory Cube) demonstrate great potential for graph processing. However, existing …

SpaceA: Sparse matrix vector multiplication on processing-in-memory accelerator

X **e, Z Liang, P Gu, A Basak, L Deng… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Sparse matrix-vector multiplication (SpMV) is an important primitive across a wide range of
application domains such as scientific computing and graph analytics. Due to its intrinsic …

Prodigy: Improving the memory latency of data-indirect irregular workloads using hardware-software co-design

N Talati, K May, A Behroozi, Y Yang… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Irregular workloads are typically bottlenecked by the memory system. These workloads often
use sparse data representations, eg, compressed sparse row/column (CSR/CSC), to …

Pangolin: An efficient and flexible graph mining system on cpu and gpu

X Chen, R Dathathri, G Gill, K **ali - Proceedings of the VLDB …, 2020 - dl.acm.org
There is growing interest in graph pattern mining (GPM) problems such as motif counting.
GPM systems have been developed to provide unified interfaces for programming …

Alleviating irregularity in graph analytics acceleration: A hardware/software co-design approach

M Yan, X Hu, S Li, A Basak, H Li, X Ma… - Proceedings of the …, 2019 - dl.acm.org
Graph analytics is an emerging application which extracts insights by processing large
volumes of highly connected data, namely graphs. The parallel processing of graphs has …

GraphA: An efficient ReRAM-based architecture to accelerate large scale graph processing

SA Ghasemi, B Jahannia, H Farbeh - Journal of Systems Architecture, 2022 - Elsevier
Graph analytics is the basis for many modern applications, eg, machine learning and
streaming data problems. With an unprecedented increase in data size of many emerging …

Optimizing cpu performance for recommendation systems at-scale

R Jain, S Cheng, V Kalagi, V Sanghavi, S Kaul… - Proceedings of the 50th …, 2023 - dl.acm.org
Deep Learning Recommendation Models (DLRMs) are very popular in personalized
recommendation systems and are a major contributor to the data-center AI cycles. Due to the …

Characterizing and understanding GCNs on GPU

M Yan, Z Chen, L Deng, X Ye, Z Zhang… - IEEE Computer …, 2020 - ieeexplore.ieee.org
Graph convolutional neural networks (GCNs) have achieved state-of-the-art performance on
graph-structured data analysis. Like traditional neural networks, training and inference of …