Hygcn: A gcn accelerator with hybrid architecture

M Yan, L Deng, X Hu, L Liang, Y Feng… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Inspired by the great success of neural networks, graph convolutional neural networks
(GCNs) are proposed to analyze graph data. GCNs mainly include two phases with distinct …

The tao of parallelism in algorithms

K **ali, D Nguyen, M Kulkarni, M Burtscher… - Proceedings of the …, 2011 - dl.acm.org
For more than thirty years, the parallel programming community has used the dependence
graph as the main abstraction for reasoning about and exploiting parallelism in" regular" …

Alleviating irregularity in graph analytics acceleration: A hardware/software co-design approach

M Yan, X Hu, S Li, A Basak, H Li, X Ma… - Proceedings of the …, 2019 - dl.acm.org
Graph analytics is an emerging application which extracts insights by processing large
volumes of highly connected data, namely graphs. The parallel processing of graphs has …

Lonestar: A suite of parallel irregular programs

M Kulkarni, M Burtscher, C Casçaval… - … Analysis of Systems …, 2009 - ieeexplore.ieee.org
Until recently, parallel programming has largely focused on the exploitation of data-
parallelism in dense matrix programs. However, many important application domains …

A survey on thread-level speculation techniques

A Estebanez, DR Llanos… - ACM Computing Surveys …, 2016 - dl.acm.org
Thread-Level Speculation (TLS) is a promising technique that allows the parallel execution
of sequential code without relying on a prior, compile-time-dependence analysis. In this …

Evaluating and improving the performance and scheduling of HPC applications in cloud

A Gupta, P Faraboschi, F Gioachin… - … on Cloud Computing, 2014 - ieeexplore.ieee.org
Cloud computing is emerging as a promising alternative to supercomputers for some high-
performance computing (HPC) applications. With cloud as an additional deployment option …

Kremlin: Rethinking and rebooting gprof for the multicore age

S Garcia, D Jeon, CM Louie, MB Taylor - ACM SIGPLAN Notices, 2011 - dl.acm.org
Many recent parallelization tools lower the barrier for parallelizing a program, but overlook
one of the first questions that a programmer needs to answer: which parts of the program …

Morph algorithms on GPUs

R Nasre, M Burtscher, K **ali - Proceedings of the 18th ACM SIGPLAN …, 2013 - dl.acm.org
There is growing interest in using GPUs to accelerate graph algorithms such as breadth-first
search, computing page-ranks, and finding shortest paths. However, these algorithms do not …

A survey of performance tuning techniques and tools for parallel applications

D Mustafa - IEEE Access, 2022 - ieeexplore.ieee.org
Automatic parallelization of sequential programs combined with auto-tuning is an alternative
to manual parallelization. With wider research directions and the increased number of …

[書籍][B] Introduction to high performance scientific computing

V Eijkhout - 2010 - books.google.com
Page 1 - ºf Eijkhou Page 2 Introduction to High Performance Scientific Computing Victor Eijkhout
with Edmond Chow, Robert van de Geijn 2nd edition, revision 2015 Page 3 Introduction to …