A survey on federated learning for resource-constrained IoT devices

A Imteaj, U Thakker, S Wang, J Li… - IEEE Internet of Things …, 2021 - ieeexplore.ieee.org
Federated learning (FL) is a distributed machine learning strategy that generates a global
model by learning from multiple decentralized edge clients. FL enables on-device training …

Compute-in-memory chips for deep learning: Recent trends and prospects

S Yu, H Jiang, S Huang, X Peng… - IEEE circuits and systems …, 2021 - ieeexplore.ieee.org
Compute-in-memory (CIM) is a new computing paradigm that addresses the memory-wall
problem in hardware accelerator design for deep learning. The input vector and weight …

Pruning and quantization for deep neural network acceleration: A survey

T Liang, J Glossner, L Wang, S Shi, X Zhang - Neurocomputing, 2021 - Elsevier
Deep neural networks have been applied in many applications exhibiting extraordinary
abilities in the field of computer vision. However, complex network architectures challenge …

Neuromorphic spintronics

J Grollier, D Querlioz, KY Camsari… - Nature …, 2020 - nature.com
Neuromorphic computing uses brain-inspired principles to design circuits that can perform
computational tasks with superior power efficiency to conventional computers. Approaches …

Lut-gemm: Quantized matrix multiplication based on luts for efficient inference in large-scale generative language models

G Park, B Park, M Kim, S Lee, J Kim, B Kwon… - arxiv preprint arxiv …, 2022 - arxiv.org
Recent advances in self-supervised learning and the Transformer architecture have
significantly improved natural language processing (NLP), achieving remarkably low …

Fast-scnn: Fast semantic segmentation network

RPK Poudel, S Liwicki, R Cipolla - arxiv preprint arxiv:1902.04502, 2019 - arxiv.org
The encoder-decoder framework is state-of-the-art for offline semantic image segmentation.
Since the rise in autonomous systems, real-time computation is increasingly desirable. In …

Training transformers with 4-bit integers

H **, C Li, J Chen, J Zhu - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Quantizing the activation, weight, and gradient to 4-bit is promising to accelerate neural
network training. However, existing 4-bit training methods require custom numerical formats …

Deephunter: a coverage-guided fuzz testing framework for deep neural networks

X **e, L Ma, F Juefei-Xu, M Xue, H Chen, Y Liu… - Proceedings of the 28th …, 2019 - dl.acm.org
The past decade has seen the great potential of applying deep neural network (DNN) based
software to safety-critical scenarios, such as autonomous driving. Similar to traditional …