The future of FPGA acceleration in datacenters and the cloud

C Bobda, JM Mbongue, P Chow, M Ewais… - ACM Transactions on …, 2022 - dl.acm.org
In this article, we survey existing academic and commercial efforts to provide Field-
Programmable Gate Array (FPGA) acceleration in datacenters and the cloud. The goal is a …

OmpSs@ FPGA framework for high performance FPGA computing

JM De Haro, J Bosch, A Filgueras… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
This article presents the new features of the OmpSs@ FPGA framework. OmpSs is a data-
flow programming model that supports task nesting and dependencies to target …

Deploying multi-tenant FPGAs within Linux-based cloud infrastructure

JM Mbongue, DT Kwadjo, A Shu**… - ACM Transactions on …, 2021 - dl.acm.org
Cloud deployments now increasingly exploit Field-Programmable Gate Array (FPGA)
accelerators as part of virtual instances. While cloud FPGAs are still essentially single …

Xar-trek: run-time execution migration among FPGAs and heterogeneous-ISA CPUs

E Horta, HR Chuang, NR VSathish… - Proceedings of the …, 2021 - dl.acm.org
Datacenter servers are increasingly heterogeneous: from x86 host CPUs, to ARM or RISC-V
CPUs in NICs/SSDs, to FPGAs. Previous works have demonstrated that migrating …

Towards a component-based acceleration of convolutional neural networks on FPGAs

DT Kwadjo, EN Tchinda, JM Mbongue… - Journal of Parallel and …, 2022 - Elsevier
Abstract In recent years, Convolution Neural Networks (CNN) have been extensively
adopted in broad Artificial Intelligence (AI) applications and have demonstrated ability and …

An architectural template for FPGA overlays targeting data flow applications

A Drewes, V Burtsev, B Gurumurthy… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
In this paper, we present a flexible and scalable architectural template for designing domain-
specific FPGA overlays. The template can be parametrized in terms of number and size of …

Performance exploration on pre-implemented CNN hardware accelerator on FPGA

DT Kwadjo, JM Mbongue… - … Conference on Field …, 2020 - ieeexplore.ieee.org
As the complexity of FPGA architectures increases, there is a raising need to improved
productivity and performance in several computing domains such as image processing …

Exploring a Layer-based Pre-implemented Flow for Map** CNN on FPGA

DT Kwadjo, JM Mbongue… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Convolutional Neural Networks are compute-intensive learning models that have
demonstrated ability and effectiveness in solving complex learning problems. However …

Runtime Management of Dynamic Dataflows with Partially Reconfigurable Pipelines on FPGAs

K Mätas - 2023 - search.proquest.com
In order to overcome the famous von Neumann bottleneck, FPGAs employ a dataflow model
that processes data through a pipeline of operator modules, akin to an assembly line for …

Exploiting the Common Case When Accelerating Input-Dependent Stream Processing by FPGA

Z Zhao, J Melber, S Sahay, S Obla… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
FPGAs have traditionally been successful in accelerating stream processing applications
where the amount and type of work performed on each record—eg, image, packet—do not …