Tools for reduced precision computation: a survey

S Cherubin, G Agosta - ACM Computing Surveys (CSUR), 2020‏ - dl.acm.org
The use of reduced precision to improve performance metrics such as computation latency
and power consumption is a common practice in the embedded systems field. This practice …

Sirnn: A math library for secure rnn inference

D Rathee, M Rathee, RKK Goli, D Gupta… - … IEEE Symposium on …, 2021‏ - ieeexplore.ieee.org
Complex machine learning (ML) inference algorithms like recurrent neural networks (RNNs)
use standard functions from math libraries like exponentiation, sigmoid, tanh, and reciprocal …

Energy-efficient approximate multiplication for digital signal processing and classification applications

S Narayanamoorthy, HA Moghaddam… - IEEE transactions on …, 2014‏ - ieeexplore.ieee.org
The need to support various digital signal processing (DSP) and classification applications
on energy-constrained devices has steadily grown. Such applications often extensively …

Compiling KB-sized machine learning models to tiny IoT devices

S Gopinath, N Ghanathe, V Seshadri… - Proceedings of the 40th …, 2019‏ - dl.acm.org
Recent advances in machine learning (ML) have produced KiloByte-size models that can
directly run on constrained IoT devices. This approach avoids expensive communication …

An introduction to distributed smart cameras

B Rinner, W Wolf - Proceedings of the IEEE, 2008‏ - ieeexplore.ieee.org
Distributed smart cameras (DSCs) are real-time distributed embedded systems that perform
computer vision using multiple cameras. This new approach has emerged thanks to a …

Real-time video analysis on an embedded smart camera for traffic surveillance

M Bramberger, J Brunner, B Rinner… - … . RTAS 2004. 10th …, 2004‏ - ieeexplore.ieee.org
A smart camera combines video sensing, high-level video processing and communication
within a single embedded device. Such cameras are key components in novel surveillance …

A holistic approach to automatic mixed-precision code generation and tuning for affine programs

J Xu, G Song, B Zhou, F Li, J Hao, J Zhao - Proceedings of the 29th ACM …, 2024‏ - dl.acm.org
Reducing floating-point (FP) precision is used to trade the quality degradation of a numerical
program's output for performance, but this optimization coincides with type casting, whose …

[PDF][PDF] A Programmable Architecture for Real-Time Derivative Trading

S Tandon - Master's Thesis, University of Edinburgh, 2003‏ - olsendata.com
Derivatives are financial securities that are used to hedge business risks, caused by
changes in foreign exchange rates, interest rates or prices of goods. The algorithms in the …

Shiftry: RNN inference in 2kb of RAM

A Kumar, V Seshadri, R Sharma - Proceedings of the ACM on …, 2020‏ - dl.acm.org
Traditionally, IoT devices send collected sensor data to an intelligent cloud where machine
learning (ML) inference happens. However, this course is rapidly changing and there is a …

Application of symbolic computer algebra in high-level data-flow synthesis

A Peymandoust, G De Micheli - IEEE transactions on computer …, 2003‏ - ieeexplore.ieee.org
The growing market of multimedia applications has required the development of complex
application-specified integrated circuits with significant data-path portions. Unfortunately …