Cosa: Scheduling by constrained optimization for spatial accelerators

Q Huang, M Kang, G Dinh, T Norell… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Recent advances in Deep Neural Networks (DNNs) have led to active development of
specialized DNN accelerators, many of which feature a large number of processing …

Modern development methods and tools for embedded reconfigurable systems: A survey

L Jóźwiak, N Nedjah, M Figueroa - Integration, 2010 - Elsevier
Heterogeneous reconfigurable systems provide drastically higher performance and lower
power consumption than traditional CPU-centric systems. Moreover, they do it at much lower …

[書籍][B] Design for embedded image processing on FPGAs

DG Bailey - 2023 - books.google.com
Design for Embedded Image Processing on FPGAs Bridge the gap between software and
hardware with this foundational design reference Field-programmable gate arrays (FPGAs) …

A decade of reconfigurable computing: a visionary retrospective

R Hartenstein - Proceedings design, automation and test in …, 2001 - ieeexplore.ieee.org
The paper surveys a decade of R&D on coarse grain reconfigurable hardware and related
CAD, points out why this emerging discipline is heading toward a dichotomy of computing …

Active pages: A computation model for intelligent memory

M Oskin, FT Chong, T Sherwood - Proceedings of the 25th annual …, 1998 - dl.acm.org
Microprocessors and memory systems suffer from a growing gap in performance. We
introduce Active Pages, a computation model which addresses this gap by shifting data …

Using machine learning to focus iterative optimization

F Agakov, E Bonilla, J Cavazos… - … Symposium on Code …, 2006 - ieeexplore.ieee.org
Iterative compiler optimization has been shown to outperform static approaches. This,
however, is at the cost of large numbers of evaluations of the program. This paper develops …

Simultaneous multithreading: A platform for next-generation processors

SJ Eggers, JS Emer, HM Levy, JL Lo, RL Stamm… - IEEE micro, 1997 - ieeexplore.ieee.org
Simultaneous multithreading is a processor design which consumes both thread-level and
instruction-level parallelism. In SMT processors, thread-level parallelism can come from …

Continuous profiling: Where have all the cycles gone?

JM Anderson, LM Berc, J Dean, S Ghemawat… - ACM Transactions on …, 1997 - dl.acm.org
This article describes the Digital Continuous Profiling Infrastructure, a sampling-based
profiling system designed to run continuously on production systems. The system supports …

[書籍][B] Modern compiler design

D Grune, K Van Reeuwijk, HE Bal, CJH Jacobs… - 2012 - books.google.com
" Modern Compiler Design" makes the topic of compiler design more accessible by focusing
on principles and techniques of wide application. By carefully distinguishing between the …

A single-chip multiprocessor

BA Nayfeh, K Olukotun - Computer, 1997 - ieeexplore.ieee.org
Presents the case for billion-transistor processor architectures that will consist of chip
multiprocessors (CMPs): multiple (four to 16) simple, fast processors on one chip. In their …