Hygcn: A gcn accelerator with hybrid architecture
Inspired by the great success of neural networks, graph convolutional neural networks
(GCNs) are proposed to analyze graph data. GCNs mainly include two phases with distinct …
(GCNs) are proposed to analyze graph data. GCNs mainly include two phases with distinct …
The tao of parallelism in algorithms
For more than thirty years, the parallel programming community has used the dependence
graph as the main abstraction for reasoning about and exploiting parallelism in" regular" …
graph as the main abstraction for reasoning about and exploiting parallelism in" regular" …
Alleviating irregularity in graph analytics acceleration: A hardware/software co-design approach
Graph analytics is an emerging application which extracts insights by processing large
volumes of highly connected data, namely graphs. The parallel processing of graphs has …
volumes of highly connected data, namely graphs. The parallel processing of graphs has …
Lonestar: A suite of parallel irregular programs
Until recently, parallel programming has largely focused on the exploitation of data-
parallelism in dense matrix programs. However, many important application domains …
parallelism in dense matrix programs. However, many important application domains …
A survey on thread-level speculation techniques
A Estebanez, DR Llanos… - ACM Computing Surveys …, 2016 - dl.acm.org
Thread-Level Speculation (TLS) is a promising technique that allows the parallel execution
of sequential code without relying on a prior, compile-time-dependence analysis. In this …
of sequential code without relying on a prior, compile-time-dependence analysis. In this …
Evaluating and improving the performance and scheduling of HPC applications in cloud
Cloud computing is emerging as a promising alternative to supercomputers for some high-
performance computing (HPC) applications. With cloud as an additional deployment option …
performance computing (HPC) applications. With cloud as an additional deployment option …
Kremlin: Rethinking and rebooting gprof for the multicore age
Many recent parallelization tools lower the barrier for parallelizing a program, but overlook
one of the first questions that a programmer needs to answer: which parts of the program …
one of the first questions that a programmer needs to answer: which parts of the program …
Morph algorithms on GPUs
There is growing interest in using GPUs to accelerate graph algorithms such as breadth-first
search, computing page-ranks, and finding shortest paths. However, these algorithms do not …
search, computing page-ranks, and finding shortest paths. However, these algorithms do not …
A survey of performance tuning techniques and tools for parallel applications
D Mustafa - IEEE Access, 2022 - ieeexplore.ieee.org
Automatic parallelization of sequential programs combined with auto-tuning is an alternative
to manual parallelization. With wider research directions and the increased number of …
to manual parallelization. With wider research directions and the increased number of …
[書籍][B] Introduction to high performance scientific computing
V Eijkhout - 2010 - books.google.com
Page 1 - ºf Eijkhou Page 2 Introduction to High Performance Scientific Computing Victor Eijkhout
with Edmond Chow, Robert van de Geijn 2nd edition, revision 2015 Page 3 Introduction to …
with Edmond Chow, Robert van de Geijn 2nd edition, revision 2015 Page 3 Introduction to …