FPGA HLS today: successes, challenges, and opportunities
The year 2011 marked an important transition for FPGA high-level synthesis (HLS), as it
went from prototy** to deployment. A decade later, in this article, we assess the progress …
went from prototy** to deployment. A decade later, in this article, we assess the progress …
Pushing the level of abstraction of digital system design: A survey on how to program fpgas
Field Programmable Gate Arrays (FPGAs) are spatial architectures with a heterogeneous
reconfigurable fabric. They are state-of-the-art for prototy**, telecommunications …
reconfigurable fabric. They are state-of-the-art for prototy**, telecommunications …
Interstellar: Using halide's scheduling language to analyze dnn accelerators
We show that DNN accelerator micro-architectures and their program map**s represent
specific choices of loop order and hardware parallelism for computing the seven nested …
specific choices of loop order and hardware parallelism for computing the seven nested …
SODA: Stencil with optimized dataflow architecture
Stencil computation is one of the most important kernels in many application domains such
as image processing, solving partial differential equations, and cellular automata. Many of …
as image processing, solving partial differential equations, and cellular automata. Many of …
A compiler infrastructure for accelerator generators
We present Calyx, a new intermediate language (IL) for compiling high-level programs into
hardware designs. Calyx combines a hardware-like structural language with a software-like …
hardware designs. Calyx combines a hardware-like structural language with a software-like …
HeteroCL: A multi-paradigm programming infrastructure for software-defined reconfigurable computing
With the pursuit of improving compute performance under strict power constraints, there is
an increasing need for deploying applications to heterogeneous hardware architectures with …
an increasing need for deploying applications to heterogeneous hardware architectures with …
iPIM: Programmable in-memory image processing accelerator using near-bank architecture
Image processing is becoming an increasingly important domain for many applications on
workstations and the datacenter that require accelerators for high performance and energy …
workstations and the datacenter that require accelerators for high performance and energy …
Transformations of high-level synthesis codes for high-performance computing
Spatial computing architectures promise a major stride in performance and energy efficiency
over the traditional load/store devices currently employed in large scale computing systems …
over the traditional load/store devices currently employed in large scale computing systems …
OverGen: Improving FPGA usability through domain-specific overlay generation
FPGAs have been proven to be powerful computational accelerators across many types of
workloads. The mainstream programming approach is high level synthesis (HLS), which …
workloads. The mainstream programming approach is high level synthesis (HLS), which …
Predictable accelerator design with time-sensitive affine types
Field-programmable gate arrays (FPGAs) provide an opportunity to co-design applications
with hardware accelerators, yet they remain difficult to program. High-level synthesis (HLS) …
with hardware accelerators, yet they remain difficult to program. High-level synthesis (HLS) …