Exploitation of gpus for the parallelisation of probably parallel legacy code
General purpose Gpus provide massive compute power, but are notoriously difficult to
program. In this paper we present a complete compilation strategy to exploit Gpus for the …
program. In this paper we present a complete compilation strategy to exploit Gpus for the …
Fast loop-level data dependence profiling
H Yu, Z Li - Proceedings of the 26th ACM international conference …, 2012 - dl.acm.org
Execution-driven data dependence profiling has gained significant interest as a tool to
compensate the weakness of static data dependence analysis. Although such dependence …
compensate the weakness of static data dependence analysis. Although such dependence …
Multi-slicing: a compiler-supported parallel approach to data dependence profiling
H Yu, Z Li - Proceedings of the 2012 International Symposium on …, 2012 - dl.acm.org
Retrofitting existing software for the increasingly dominant multicore microprocessors has a
strong appeal from the economic point of view. One of the key issues in such an effort is to …
strong appeal from the economic point of view. One of the key issues in such an effort is to …
Quantifying the potential task-based dataflow parallelism in MPI applications
Task-based parallel programming languages require the programmer to partition the
traditional sequential code into smaller tasks in order to take advantage of the existing …
traditional sequential code into smaller tasks in order to take advantage of the existing …
[PDF][PDF] Polymorphic attacks against sequence-based software birthmarks
Sequence alignment algorithms have recently found a use in detecting code clones,
software plagiarism, code theft, and polymorphic malware. This approach involves extracting …
software plagiarism, code theft, and polymorphic malware. This approach involves extracting …
Free the Conqueror! Refactoring divide-and-conquer functions
Divide-and-conquer algorithms appear in the solution of many computationally intensive
problems, and are good candidates for parallelization. A divide-and-conquer computation …
problems, and are good candidates for parallelization. A divide-and-conquer computation …
Refactorings to enable parallelization
We propose program analyses to identify parallelizable code fragments, and program
transformations to change those fragments into applications of high-level parallel patterns …
transformations to change those fragments into applications of high-level parallel patterns …
Tareador: a tool to unveil parallelization strategies at undergraduate level
This paper presents a methodology and framework designed to assist students in the
process of finding appropriate task decomposition strategies for their sequential program, as …
process of finding appropriate task decomposition strategies for their sequential program, as …
Patty: A pattern-based parallelization tool for the multicore age
The free lunch of ever increasing clock frequencies is over. Performance-critical sequential
software must be parallelized, and this is tedious, hard, buggy, knowledge-intensive, and …
software must be parallelized, and this is tedious, hard, buggy, knowledge-intensive, and …
Automatic exploration of potential parallelism in sequential applications
The multicore era has increased the need for highly parallel software. Since automatic
parallelization turned out ineffective for many production codes, the community hopes for the …
parallelization turned out ineffective for many production codes, the community hopes for the …