Techniques for combining fast local decoders with global decoders under circuit-level noise

C Chamberland, L Goncalves… - Quantum Science …, 2023 - iopscience.iop.org
Implementing algorithms on a fault-tolerant quantum computer will require fast decoding
throughput and latency times to prevent an exponential increase in buffer times between the …

O3BNN-R: An out-of-order architecture for high-performance and regularized BNN inference

T Geng, A Li, T Wang, C Wu, Y Li, R Shi… - … on parallel and …, 2020 - ieeexplore.ieee.org
Binarized Neural Networks (BNN), which significantly reduce computational complexity and
memory demand, have shown potential in cost-and power-restricted domains, such as IoT …

Accelerating binarized neural networks via bit-tensor-cores in turing gpus

A Li, S Su - IEEE Transactions on Parallel and Distributed …, 2020 - ieeexplore.ieee.org
Despite foreseeing tremendous speedups over conventional deep neural networks, the
performance advantage of binarized neural networks (BNNs) has merely been showcased …

FPDeep: Scalable acceleration of CNN training on deeply-pipelined FPGA clusters

T Wang, T Geng, A Li, X **… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Deep convolutional Neural Networks (CNNs) have revolutionized numerous applications,
but the demand for ever more performance remains unabated. Scaling CNN computations to …

BSTC: A novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets

A Li, T Geng, T Wang, M Herbordt, SL Song… - Proceedings of the …, 2019 - dl.acm.org
Binarized neural networks (or BNNs) promise tremendous performance improvement over
traditional DNNs through simplified bit-level computation and significantly reduced memory …

Facial expression recognition based on active region of interest using deep learning and parallelism

MA Hossain, B Assiri - PeerJ Computer Science, 2022 - peerj.com
The automatic facial expression tracking method has become an emergent topic during the
last few decades. It is a challenging problem that impacts many fields such as virtual reality …

Extending data flow architectures for convolutional neural networks to multiple fpgas

M Ibrahim, Z Zhao, M Hall, V Betz - … International Conference on …, 2023 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) are the state-of-the-art Artificial Intelligence
frameworks that are used for a variety of applications including autonomous driving …

GAAF: searching activation functions for binary neural networks through genetic algorithm

Y Li, T Geng, S Stein, A Li, H Yu - Tsinghua Science and …, 2022 - ieeexplore.ieee.org
Binary neural networks (BNNs) show promising utilization in cost and power-restricted
domains such as edge devices and mobile systems. This is due to its significantly less …

A survey: Handling irregularities in neural network acceleration with fpgas

T Geng, C Wu, C Tan, C **e, A Guo… - 2021 IEEE High …, 2021 - ieeexplore.ieee.org
In the last decade, Artificial Intelligence (AI) through Deep Neural Networks (DNNs) has
penetrated virtually every aspect of science, technology, and business. Many types of DNNs …

η-lstm: Co-designing highly-efficient large lstm training via exploiting memory-saving and architectural design opportunities

X Zhang, H **a, D Zhuang, H Sun, X Fu… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Recently, the recurrent neural network, or its most popular type—the Long Short Term
Memory (LSTM) network—has achieved great success in a broad spectrum of real-world …