SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters

J Kim, S Seo, J Lee, J Nah, G Jo, J Lee - Proceedings of the 26th ACM …, 2012 - dl.acm.org
In this paper, we propose SnuCL, an OpenCL framework for heterogeneous CPU/GPU
clusters. We show that the original OpenCL semantics naturally fits to the heterogeneous …

Fluidic kernels: Cooperative execution of opencl programs on multiple heterogeneous devices

P Pandit, R Govindarajan - … IEEE/ACM International Symposium on Code …, 2014 - dl.acm.org
Programming heterogeneous computing systems with Graphics Processing Units (GPU) and
multi-core CPUs in them is complex and time-consuming. OpenCL has emerged as an …

Automatic OpenCL work-group size selection for multicore CPUs

S Seo, J Lee, G Jo, J Lee - Proceedings of the 22nd …, 2013 - ieeexplore.ieee.org
In this paper, we address the effect of the work-group size on the performance of OpenCL
kernels. We propose a profiling-based algorithm that finds a good work-group size, in terms …

Enabling SIMT execution model on homogeneous multi-core system

KC Chen, CH Chen - ACM Transactions on Architecture and Code …, 2018 - dl.acm.org
Single-instruction multiple-thread (SIMT) machine emerges as a primary computing device
in high-perfor-mance computing, since the SIMT execution paradigm can exploit data-level …

A case for fine-grain coherence specialization in heterogeneous systems

J Alsop, WT Na, MD Sinclair, S Grayson… - ACM Transactions on …, 2022 - dl.acm.org
Hardware specialization is becoming a key enabler of energy-efficient performance. Future
systems will be increasingly heterogeneous, integrating multiple specialized and …

The BlackParrot BedRock Cache Coherence System

M Wyse, D Petrisko, F Gilani, YM Chueh, P Gao… - arxiv preprint arxiv …, 2022 - arxiv.org
This paper presents BP-BedRock, the open-source cache coherence protocol and system
implemented within the BlackParrot 64-bit RISC-V multicore processor. BP-BedRock …

Tfluxscc: Exploiting performance on future many-core systems through data-flow

A Diavastos, G Stylianou… - 2015 23rd Euromicro …, 2015 - ieeexplore.ieee.org
The current trend in processor design is to increase the number of cores as to achieve a
desired performance. While having a large number of cores on a chip seems to be feasible …

Comparative study of parallel programming models for multicore computing

A Ali - 2013 - diva-portal.org
Shared memory multi-core processor technology has seen a drastic development with faster
and increasing number of processors per chip. This new architecture challenges computer …

[BOOK][B] Programming Frameworks for Improving the Productivity and Performance of Manycore Architectures

L Cheng - 2022 - search.proquest.com
Manycore architectures integrate hundreds of cores on a single chip by using simple cores
and simple memory systems usually based on software-managed scratchpad memories …

[PDF][PDF] Cloud computing

AS Braga, GM Silva, MC Barros - Instituto de Computação …, 2012 - ic.unicamp.br
Cloud computing surgiu recentemente como um novo paradigma para a indústria da
informática com relaçaoa disponibilidade e acesso a recursos através da Internet. Ela tem …