Sisa: Set-centric instruction set architecture for graph mining on processing-in-memory systems
Simple graph algorithms such as PageRank have been the target of numerous hardware
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …
DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks
Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …
ing performance, scalability, and energy efficiency in modern systems. Computer systems …
Graphq: Scalable pim-based graph processing
Processing-In-Memory (PIM) architectures based on recent technology advances (eg,
Hybrid Memory Cube) demonstrate great potential for graph processing. However, existing …
Hybrid Memory Cube) demonstrate great potential for graph processing. However, existing …
SpaceA: Sparse matrix vector multiplication on processing-in-memory accelerator
Sparse matrix-vector multiplication (SpMV) is an important primitive across a wide range of
application domains such as scientific computing and graph analytics. Due to its intrinsic …
application domains such as scientific computing and graph analytics. Due to its intrinsic …
Prodigy: Improving the memory latency of data-indirect irregular workloads using hardware-software co-design
Irregular workloads are typically bottlenecked by the memory system. These workloads often
use sparse data representations, eg, compressed sparse row/column (CSR/CSC), to …
use sparse data representations, eg, compressed sparse row/column (CSR/CSC), to …
Pangolin: An efficient and flexible graph mining system on cpu and gpu
There is growing interest in graph pattern mining (GPM) problems such as motif counting.
GPM systems have been developed to provide unified interfaces for programming …
GPM systems have been developed to provide unified interfaces for programming …
Alleviating irregularity in graph analytics acceleration: A hardware/software co-design approach
Graph analytics is an emerging application which extracts insights by processing large
volumes of highly connected data, namely graphs. The parallel processing of graphs has …
volumes of highly connected data, namely graphs. The parallel processing of graphs has …
GraphA: An efficient ReRAM-based architecture to accelerate large scale graph processing
Graph analytics is the basis for many modern applications, eg, machine learning and
streaming data problems. With an unprecedented increase in data size of many emerging …
streaming data problems. With an unprecedented increase in data size of many emerging …
Optimizing cpu performance for recommendation systems at-scale
Deep Learning Recommendation Models (DLRMs) are very popular in personalized
recommendation systems and are a major contributor to the data-center AI cycles. Due to the …
recommendation systems and are a major contributor to the data-center AI cycles. Due to the …
Characterizing and understanding GCNs on GPU
Graph convolutional neural networks (GCNs) have achieved state-of-the-art performance on
graph-structured data analysis. Like traditional neural networks, training and inference of …
graph-structured data analysis. Like traditional neural networks, training and inference of …