Scalable superconductor neuron with ternary synaptic connections for ultra-fast SNN hardware

MA Karamuftuoglu, BZ Ucpinar, A Fayyazi… - Superconductor …, 2025 - iopscience.iop.org
A novel high-fan-in differential superconductor neuron structure designed for ultra-high-
performance spiking neural network (SNN) accelerators is presented. Utilizing a high-fan-in …

Accelerating deep learning model inference on arm cpus with ultra-low bit quantization and runtime

S Ashfaq, MH AskariHemmat, S Sah, E Saboori… - arxiv preprint arxiv …, 2022 - arxiv.org
Deep Learning has been one of the most disruptive technological advancements in recent
times. The high performance of deep learning models comes at the expense of high …

[HTML][HTML] 4.6-bit quantization for fast and accurate neural network inference on CPUs

A Trusov, E Limonova, D Nikolaev, VV Arlazarov - Mathematics, 2024 - mdpi.com
Quantization is a widespread method for reducing the inference time of neural networks on
mobile Central Processing Units (CPUs). Eight-bit quantized networks demonstrate similarly …

A method of using RSVD in residual calculation of LowBit GEMM

H Gu - arxiv preprint arxiv:2409.18772, 2024 - arxiv.org
The advancements of hardware technology in recent years has brought many possibilities
for low-precision applications. However, the use of low precision can introduce significant …

The promise of training deep neural networks on CPUs: A survey

W He - Journal of Physics: Conference Series, 2023 - iopscience.iop.org
This survey presents a comprehensive analysis of the potential benefits and challenges of
training deep neural networks (DNNs) on CPUs, summarizing existing research in the field …

Efficient hardware acceleration of deep neural networks via arithmetic complexity reduction

E Reggiani - 2023 - upcommons.upc.edu
(English) Over the past decade, significant progresses in the field of artificial intelligence
have led to remarkable advancements in a wide range of technologies. Deep learning, a …

Ultra Low Bit Quantization And Neural Networks

MS Ashfaq, MHA HEMMAT, SAH Sudhakar… - US Patent App. 18 …, 2024 - Google Patents
A system, method, and computer readable medium for deploying neural networks in low bit
environments. The system comprises a runtime platform, a first set of configuration …