Machine learning for microcontroller-class hardware: A review

SS Saha, SS Sandha, M Srivastava - IEEE Sensors Journal, 2022‏ - ieeexplore.ieee.org
The advancements in machine learning (ML) opened a new opportunity to bring intelligence
to the low-end Internet-of-Things (IoT) nodes, such as microcontrollers. Conventional ML …

FPGA-based accelerators of deep learning networks for learning and classification: A review

A Shawahna, SM Sait, A El-Maleh - ieee Access, 2018‏ - ieeexplore.ieee.org
Due to recent advances in digital technologies, and availability of credible data, an area of
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …

{TVM}: An automated {End-to-End} optimizing compiler for deep learning

T Chen, T Moreau, Z Jiang, L Zheng, E Yan… - … USENIX Symposium on …, 2018‏ - usenix.org
There is an increasing need to bring machine learning to a wide diversity of hardware
devices. Current frameworks rely on vendor-specific operator libraries and optimize for a …

Randomized numerical linear algebra: Foundations and algorithms

PG Martinsson, JA Tropp - Acta Numerica, 2020‏ - cambridge.org
This survey describes probabilistic algorithms for linear algebraic computations, such as
factorizing matrices and solving linear systems. It focuses on techniques that have a proven …

Morpho and Rvcg–shape analysis in R: R-packages for geometric morphometrics, shape analysis and surface manipulations

S Schlager - Statistical shape and deformation analysis, 2017‏ - Elsevier
The mathematical/statistical software platform R has seen an immense increase in
popularity within the last decade. Its main advantages are its flexibility, a large repository of …

Learning to optimize tensor programs

T Chen, L Zheng, E Yan, Z Jiang… - Advances in …, 2018‏ - proceedings.neurips.cc
We introduce a learning-based framework to optimize tensor programs for deep learning
workloads. Efficient implementations of tensor operators, such as matrix multiplication and …

Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions

N Vasilache, O Zinenko, T Theodoridis, P Goyal… - arxiv preprint arxiv …, 2018‏ - arxiv.org
Deep learning models with convolutional and recurrent networks are now ubiquitous and
analyze massive amounts of audio, image, video, text and graph data, with applications in …

Ansor: Generating {High-Performance} tensor programs for deep learning

L Zheng, C Jia, M Sun, Z Wu, CH Yu, A Haj-Ali… - … USENIX symposium on …, 2020‏ - usenix.org
High-performance tensor programs are crucial to guarantee efficient execution of deep
neural networks. However, obtaining performant tensor programs for different operators on …

Throughput-optimized OpenCL-based FPGA accelerator for large-scale convolutional neural networks

N Suda, V Chandra, G Dasika, A Mohanty… - Proceedings of the …, 2016‏ - dl.acm.org
Convolutional Neural Networks (CNNs) have gained popularity in many computer vision
applications such as image classification, face detection, and video analysis, because of …

OpenMx 2.0: Extended structural equation and statistical modeling

MC Neale, MD Hunter, JN Pritikin, M Zahery, TR Brick… - Psychometrika, 2016‏ - Springer
The new software package OpenMx 2.0 for structural equation and other statistical modeling
is introduced and its features are described. OpenMx is evolving in a modular direction and …