Accelerating CNN inference on ASICs: A survey
Convolutional neural networks (CNNs) have proven to be a disruptive technology in most
vision, speech and image processing tasks. Given their ubiquitous acceptance, the research …
vision, speech and image processing tasks. Given their ubiquitous acceptance, the research …
Introduction to spin wave computing
This paper provides a tutorial overview over recent vigorous efforts to develop computing
systems based on spin waves instead of charges and voltages. Spin-wave computing can …
systems based on spin waves instead of charges and voltages. Spin-wave computing can …
[LIVRE][B] Handbook of signal processing systems
In this new edition of the Handbook of Signal Processing Systems, many of the chapters
from the previous editions have been updated, and several new chapters have been added …
from the previous editions have been updated, and several new chapters have been added …
Coarse-grained reconfigurable array architectures
Abstract Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same
inner loops that benefit from the high instruction-level parallelism (ILP) support in very long …
inner loops that benefit from the high instruction-level parallelism (ILP) support in very long …
A high-performance multiply-accumulate unit by integrating additions and accumulations into partial product reduction process
CW Tung, SH Huang - Ieee Access, 2020 - ieeexplore.ieee.org
In this paper, we propose a low-power high-speed pipeline multiply-accumulate (MAC)
architecture. In a conventional MAC, carry propagations of additions (including additions in …
architecture. In a conventional MAC, carry propagations of additions (including additions in …
[LIVRE][B] Top-down digital VLSI design: from architectures to gate-level circuits and FPGAs
H Kaeslin - 2014 - books.google.com
Top-Down VLSI Design: From Architectures to Gate-Level Circuits and FPGAs represents a
unique approach to learning digital design. Developed from more than 20 years teaching …
unique approach to learning digital design. Developed from more than 20 years teaching …
Architectures for real-time volume rendering
H Pfister - Future generation computer systems, 1999 - Elsevier
Over the last decade, volume rendering has become an invaluable visualization technique
for a wide variety of applications. This paper reviews three special-purpose architectures for …
for a wide variety of applications. This paper reviews three special-purpose architectures for …
Improving the performance of hyperspectral image and signal processing algorithms using parallel, distributed and specialized hardware-based systems
Advances in sensor technology are revolutionizing the way remotely sensed data is
collected, managed and analyzed. The incorporation of latest-generation sensors to …
collected, managed and analyzed. The incorporation of latest-generation sensors to …
Parallelization in co-compilation for configurable accelerators-a host/accelerator partitioning compilation method
The paper introduces a novel co-compiler and its" vertical" parallelization method, including
a general model for co-operating host/accelerator platforms and a new parallelizing …
a general model for co-operating host/accelerator platforms and a new parallelizing …
Scenario-driven dynamic analysis for comprehending large software systems
Understanding large software systems is simplified when a combination of techniques for
static and dynamic analysis is employed. Effective dynamic analysis requires that execution …
static and dynamic analysis is employed. Effective dynamic analysis requires that execution …