GPUpIO: The case for I/O-driven preemption on GPUs

L Zeno, A Mendelson, M Silberstein - … of the 9th Annual Workshop on …, 2016 - dl.acm.org
As GPUs become general purpose, they are outgrowing the coprocessor model and require
convenient I/O abstractions such as files and network sockets. Recent studies have shown …

Real-time multiview data fusion for object tracking with RGBD sensors

A Amamra, N Aouf - Robotica, 2016 - cambridge.org
This paper presents a new approach to accurately track a moving vehicle with a multiview
setup of red–green–blue depth (RGBD) cameras. We first propose a correction method to …

OpenMC: Towards simplifying programming for TianHe supercomputers

XK Liao, CQ Yung, T Tang, HZ Yi, F Wang… - Journal of Computer …, 2014 - Springer
Modern petascale and future exascale systems are massively heterogeneous architectures.
Develo** productive intra-node programming models is crucial toward addressing their …

On expressing strategies for directive-driven multicore programing models

R Nobre, P Pinto, T Carvalho, JMP Cardoso… - … of Workshop on …, 2014 - dl.acm.org
A common migration path for applications to high-performance multicore architectures relies
on code annotations with concurrent semantics. Some annotations, however, are very target …

Efficient target and application specific selection and ordering of compiler passes

RJF Nobre - 2017 - search.proquest.com
Programmers usually rely on one from a set of optimizing compiler optimization level flags
shipped with the compiler they are using to compile their source code. Those compiler flags …

[PDF][PDF] Tools for improving performance portability in heterogeneous environments

JF Fabeiro, DA CANOSA - 2017 - core.ac.uk
Parallel computing is currently partially dominated by the availability of heterogeneous
devices. These devices differ from each other in aspects such as the instruction set they …

Accelerating interpreted programming languages on GPUs with just-in-time compilation and runtime optimisations

F Alfonso, J José - 2017 - era.ed.ac.uk
Nowadays, most computer systems are equipped with powerful parallel devices such as
Graphics Processing Units (GPUs). They are present in almost every computer system …

Tools for improving performance portability in heterogeneous environments

J Fernández-Fabeiro - 2017 - ruc.udc.es
Parallel computing is currently partially dominated by the availability of heterogeneous
devices. These devices differ from each other in aspects such as the instruction set they …

[PDF][PDF] Paulo Roberto Pereira de Souza Filho

A de Alto Desempenho - 2016 - maxwell.vrac.puc-rio.br
Rocque. Auxílio a Portabilidade de Código em Aplicações de Alto Desempenho. Rio de
Janeiro, 2016. 117p. Dissertação de Mestrado—Departamento de Informática, Pontifícia …

Accelerating range-based loops on heterogeneous systems

C Suwancharoen, W Marurngsith - 2015 7th International …, 2015 - ieeexplore.ieee.org
Range-based loop is a powerful construct due to its clear and concise syntax. The
abstraction of loop index in a range-based loop implies loop-level parallelism ready to be …