Dalorex: A data-local program execution and architecture for memory-bound applications

M Orenes-Vera, E Tureci, D Wentzlaff… - … Symposium on High …, 2023‏ - ieeexplore.ieee.org
Applications with low data reuse and frequent irregular memory accesses, such as graph or
sparse linear algebra workloads, fail to scale well due to memory bottlenecks and poor core …

Hardware architectures for public key cryptography

L Batina, SB Örs, B Preneel, J Vandewalle - Integration, 2003‏ - Elsevier
This paper presents an overview of hardware implementations for the two commonly used
types of public key cryptography, ie RSA and elliptic curve cryptography, both based on …

Chlorophyll: Synthesis-aided compiler for low-power spatial architectures

PM Phothilimthana, T Jelvis, R Shah, N Totla… - ACM SIGPLAN …, 2014‏ - dl.acm.org
We developed Chlorophyll, a synthesis-aided programming model and compiler for the
GreenArrays GA144, an extremely minimalist low-power spatial architecture that requires …

Defect tolerance on the teramac custom computer

WB Culbertson, R Amerson, RJ Carter… - … . The 5th Annual …, 1997‏ - ieeexplore.ieee.org
Teramac is a large custom computer which works correctly despite the fact that three
quarters of its FPGAs contain defects. This is accomplished through unprecedented use of …

Transmuter: Bridging the efficiency gap using memory and dataflow reconfiguration

S Pal, S Feng, D Park, S Kim, A Amarnath… - Proceedings of the …, 2020‏ - dl.acm.org
With the end of Dennard scaling and Moore's law, it is becoming increasingly difficult to build
hardware for emerging applications that meet power and performance targets, while …

Versa: A 36-core systolic multiprocessor with dynamically reconfigurable interconnect and memory

S Kim, M Fayazi, A Daftardar, KY Chen… - IEEE Journal of Solid …, 2022‏ - ieeexplore.ieee.org
We present Versa, an energy-efficient 36-core systolic multiprocessor with dynamically
reconfigurable interconnects and memory. Versa leverages reconfigurable functional units …

Real-time FPGA-based Kalman filter for constant and non-constant velocity periodic error correction

C Wang, ED Burnham-Fay, JD Ellis - Precision Engineering, 2017‏ - Elsevier
Displacement measuring interferometry has high resolution and high dynamic range, which
is widely used in displacement metrology and sensor calibration. Due to beam leakage in …

Codesign tradeoffs for high-performance, low-power linear algebra architectures

A Pedram, RA Van De Geijn… - IEEE Transactions on …, 2012‏ - ieeexplore.ieee.org
As technology is reaching physical limits, reducing power consumption is a key issue on our
path to sustained performance. In this paper, we study fundamental tradeoffs and limits in …

Dataflow-based parallelization of control-flow algorithms

N Korolija, J Popović, M Cvetanović, M Bojović - Advances in computers, 2017‏ - Elsevier
Compared to control-flow architectures, dataflow architectures usually offer better
performances in high performance computing. Moreover, dataflow architectures consume …

A multichannel and compact time to digital converter for time of flight positron emission tomography

N Marino, F Baronti, L Fanucci… - … on Nuclear Science, 2015‏ - ieeexplore.ieee.org
This paper presents a novel multichannel time to digital converter (TDC) specifically
designed for the digitization of photon time of flight (TOF) and energy in positron emission …