System design using Khan process networks: the Compaan/Laura approach

T Stefanov, C Zissulescu, A Turjan… - … Automation and Test …, 2004‏ - ieeexplore.ieee.org
New emerging embedded system platforms in the realm of high-throughput multimedia,
imaging, and signal processing will consist of multiple microprocessors and reconfigurable …

A quantitative analysis of the speedup factors of FPGAs over processors

Z Guo, W Najjar, F Vahid, K Vissers - … of the 2004 ACM/SIGDA 12th …, 2004‏ - dl.acm.org
The speedup over a microprocessor that can be achieved by implementing some programs
on an FPGA has been extensively reported. This paper presents an analysis, both …

High-level language abstraction for reconfigurable computing

WA Najjar, W Bohm, BA Draper, J Hammes… - Computer, 2003‏ - ieeexplore.ieee.org
RC systems typically consist of an array of configurable computing elements. The
computational granularity of these elements ranges from simple gates-as abstracted by …

Profiling tools for hardware/software partitioning of embedded applications

DC Suresh, WA Najjar, F Vahid, JR Villarreal… - Proceedings of the 2003 …, 2003‏ - dl.acm.org
Loops constitute the most executed segments of programs and therefore are the best
candidates for hardware software partitioning. We present a set of profiling tools that are …

Understanding the prototy** strategies of experienced designers

E Hilton, J Linsey, J Goodman - 2015 IEEE Frontiers in …, 2015‏ - ieeexplore.ieee.org
Engineering students need to learn highly effective processes for pursuing difficult design
problems that require highly innovative solutions. Few studies exist of highly successful …

Automatic Software Tailoring for Optimal Performance

JM Aragón-Jurado, JC de la Torre… - IEEE Transactions …, 2023‏ - ieeexplore.ieee.org
Efficient green software solutions require being aware of the characteristics of both the
software and the hardware where it is executed. Separately optimizing them leads to …

[PDF][PDF] DART: a functional-level reconfigurable architecture for high energy efficiency

S Pillement, O Sentieys, R David - EURASIP Journal on Embedded …, 2007‏ - Springer
Flexibility becomes a major concern for the development of multimedia and mobile
communication systems, as well as classical high-performance and low-energy …

Tiny instruction caches for low power embedded systems

A Gordon-Ross, S Cotterell, F Vahid - ACM Transactions on Embedded …, 2003‏ - dl.acm.org
Instruction caches have traditionally been used to improve software performance. Recently,
several tiny instruction cache designs, including filter caches and dynamic loop caches …

Synthesis of customized loop caches for core-based embedded systems

S Cotterell, F Vahid - Proceedings of the 2002 IEEE/ACM international …, 2002‏ - dl.acm.org
Embedded system programs tend to spend much time in small loops. Introducing a very
small loop cache into the instruction memory hierarchy has thus been shown to substantially …

OpenCL Performance on the Intel Heterogeneous Architecture Research Platform

S Harris, RD Chamberlain, C Gill - 2020 IEEE High …, 2020‏ - ieeexplore.ieee.org
The fundamental operation of matrix multiplication is ubiquitous across a myriad of
disciplines. Yet, the identification of new optimizations for matrix multiplication remains …