Programming and synthesis for software-defined FPGA acceleration: status and future prospects
FPGA-based accelerators are increasingly popular across a broad range of applications,
because they offer massive parallelism, high energy efficiency, and great flexibility for …
because they offer massive parallelism, high energy efficiency, and great flexibility for …
ThunderGP: HLS-based graph processing framework on FPGAs
FPGA has been an emerging computing infrastructure in datacenters benefiting from
features of fine-grained parallelism, energy efficiency, and reconfigurability. Meanwhile …
features of fine-grained parallelism, energy efficiency, and reconfigurability. Meanwhile …
Towards a uniform template-based architecture for accelerating 2D and 3D CNNs on FPGA
J Shen, Y Huang, Z Wang, Y Qiao, M Wen… - Proceedings of the 2018 …, 2018 - dl.acm.org
Three-dimensional convolutional neural networks (3D CNNs) are used efficiently in many
computer vision applications. Most previous work in this area has concentrated only on …
computer vision applications. Most previous work in this area has concentrated only on …
TGPA: Tile-grained pipeline architecture for low latency CNN inference
FPGAs are more and more widely used as reconfigurable hardware accelerators for
applications leveraging convolutional neural networks (CNNs) in recent years. Previous …
applications leveraging convolutional neural networks (CNNs) in recent years. Previous …
FPGA sharing in the cloud: a comprehensive analysis
Cloud vendors are actively adopting FPGAs into their infrastructures for enhancing
performance and efficiency. As cloud services continue to evolve, FPGA (field …
performance and efficiency. As cloud services continue to evolve, FPGA (field …
Automated accelerator generation and optimization with composable, parallel and pipeline architecture
CPU-FPGA heterogeneous architectures feature flexible acceleration of many workloads to
advance computational capabilities and energy efficiency in today's datacenters. This …
advance computational capabilities and energy efficiency in today's datacenters. This …
FANS: FPGA-accelerated near-storage sorting
Large-scale sorting is always an important yet demanding task for data center applications.
In addition to powerful processing capability, high-performance sorting system requires …
In addition to powerful processing capability, high-performance sorting system requires …
COSMOS: Coordination of high-level synthesis and memory optimization for hardware accelerators
Hardware accelerators are key to the efficiency and performance of system-on-chip (SoC)
architectures. With high-level synthesis (HLS), designers can easily obtain several …
architectures. With high-level synthesis (HLS), designers can easily obtain several …
Demystifying the memory system of modern datacenter FPGAs for software programmers through microbenchmarking
With the public availability of FPGAs from major cloud service providers like AWS, Alibaba,
and Nimbix, hardware and software developers can now easily access FPGA platforms …
and Nimbix, hardware and software developers can now easily access FPGA platforms …
Automatic creation of high-bandwidth memory architectures from domain-specific languages: The case of computational fluid dynamics
Numerical simulations can help solve complex problems. Most of these algorithms are
massively parallel and thus good candidates for FPGA acceleration thanks to spatial …
massively parallel and thus good candidates for FPGA acceleration thanks to spatial …