Zorua: A holistic approach to resource virtualization in GPUs
This paper introduces a new resource virtualization framework, Zorua, that decouples the
programmer-specified resource usage of a GPU application from the actual allocation in the …
programmer-specified resource usage of a GPU application from the actual allocation in the …
Diplomat, an agent in a multi agent environment: An overview
In-place data manipulation is very desirable in many-core architectures with limited on-
board memory. This paper deals with the in-place implementation of a class of primitives that …
board memory. This paper deals with the in-place implementation of a class of primitives that …
What is ahead for parallel computing
W Hwu - Journal of Parallel and Distributed Computing, 2014 - Elsevier
With the industry-wide switch to multicore and manycore architectures, parallel computing
has become the only venue in sight for continued growth in application performance. In …
has become the only venue in sight for continued growth in application performance. In …
Automatic data layout optimizations for gpus
K Kofler, B Cosenza, T Fahringer - … , Vienna, Austria, August 24-28, 2015 …, 2015 - Springer
Memory optimizations have became increasingly important in order to fully exploit the
computational power of modern GPUs. The data arrangement has a big impact on the …
computational power of modern GPUs. The data arrangement has a big impact on the …
Automatic parallelization of kernels in shared-memory multi-gpu nodes
In this paper we present AMGE, a programming framework and runtime system that
transparently decomposes GPU kernels and executes them on multiple GPUs in parallel …
transparently decomposes GPU kernels and executes them on multiple GPUs in parallel …
In-place data sliding algorithms for many-core architectures
In-place data manipulation is very desirable in many-core architectures with limited on-
board memory. This paper deals with the in-place implementation of a class of primitives that …
board memory. This paper deals with the in-place implementation of a class of primitives that …
High-performance computation of bézier surfaces on parallel and heterogeneous platforms
Bézier surfaces are mathematical tools employed in a wide variety of applications. Some
works in the literature propose parallelization strategies to improve performance for the …
works in the literature propose parallelization strategies to improve performance for the …
Middleware power saving scheme for mobile applications
NZ Jhanjhi, FA Almusalli, SN Brohi… - … on Advances in …, 2018 - ieeexplore.ieee.org
Smartphones popularity, usage and users dependency has been increased over the years.
The popularity increase is linked with several factors such as smartphones size, ease in use …
The popularity increase is linked with several factors such as smartphones size, ease in use …
Shonan challenge for generative programming: short position paper
B Aktemur, Y Kameyama, O Kiselyov… - Proceedings of the ACM …, 2013 - dl.acm.org
The appeal of generative programming is" abstraction without guilt": eliminating the vexing
trade-off between writing high-level code and highly-performant code. Generative …
trade-off between writing high-level code and highly-performant code. Generative …
Energy efficient middleware: Design and development for mobile applications
Over the recent years, the popularity of smartphones has increased dramatically. The
advanced integrated technology in smartphones like GPS, high-speed CPU, a real world …
advanced integrated technology in smartphones like GPS, high-speed CPU, a real world …