From OpenCL to high-performance hardware on FPGAs

TS Czajkowski, U Aydonat, D Denisenko… - … conference on field …, 2012 - ieeexplore.ieee.org
We present an OpenCL compilation framework to generate high-performance hardware for
FPGAs. For an OpenCL application comprising a host program and a set of kernels, it …

FPGA programming for the masses

DF Bacon, R Rabbah, S Shukla - Communications of the ACM, 2013 - dl.acm.org
FPGA programming for the masses Page 1 56 communicAtionS of the Acm | APrIl 2013 | vOl. 56
| nO. 4 practice wHen looking at how hardware influences computing performance, we have …

pocl: A performance-portable OpenCL implementation

P Jääskeläinen, CS de La Lama, E Schnetter… - International Journal of …, 2015 - Springer
OpenCL is a standard for parallel programming of heterogeneous systems. The benefits of a
common programming standard are clear; multiple vendors can provide support for …

Improving performance of OpenCL on CPUs

R Karrenberg, S Hack - … : 21st International Conference, CC 2012, Held as …, 2012 - Springer
Data-parallel languages like OpenCL and CUDA are an important means to exploit the
computational power of today's computing devices. In this paper, we deal with two aspects of …

HW/SW co-design toolset for customization of exposed datapath processors

P Jääskeläinen, T Viitanen, J Takala, H Berg - Computing platforms for …, 2017 - Springer
Customized processors are an interesting option for implementing software defined radios;
they bring benefits of tailored fixed function hardware while adding new advantages such as …

Next generation distributed computing for cancer research

P Agarwal, K Owzar - Cancer informatics, 2014 - journals.sagepub.com
Advances in next generation sequencing (NGS) and mass spectrometry (MS) technologies
have provided many new opportunities and angles for extending the scope of translational …

Implementation techniques for spmd kernels on cpus

J Meyer, A Alpay, S Hack, H Fröning… - Proceedings of the 2023 …, 2023 - dl.acm.org
More and more frameworks and simulations are developed using heterogeneous
programming models such as OpenCL, SYCL, CUDA, or HIP. A significant hurdle to …

Applicability of gpgpu computing to real-time ai solutions in games

W Blewitt, G Ushaw, G Morgan - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
This paper reviews developments in general purpose computing on graphics processor
units (GPGPU computing) from the perspective of video-game-related artificial intelligence …

Lordcore: Energy-efficient opencl-programmable software-defined radio coprocessor

H Kultala, T Viitanen, H Berg… - … Transactions on Very …, 2019 - ieeexplore.ieee.org
This paper proposes a single instruction multiple data (SIMD) processor, which is
programmed with high-level OpenCL language. The low-power processor is customized for …

Evaluation of real-time LBP computing in multiple architectures

M Bordallo López, A Nieto, J Boutellier… - Journal of Real-Time …, 2017 - Springer
Local binary pattern (LBP) is a texture operator that is used in several different computer
vision applications requiring, in many cases, real-time operation in multiple computing …