Research progress on memristor: From synapses to computing systems

X Yang, B Taylor, A Wu, Y Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
As the limits of transistor technology are approached, feature size in integrated circuit
transistors has been reduced very near to the minimum physically-realizable channel length …

Accpar: Tensor partitioning for heterogeneous deep learning accelerators

L Song, F Chen, Y Zhuo, X Qian, H Li… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Deep neural network (DNN) accelerators as an example of domain-specific architecture
have demonstrated great success in DNN inference. However, the architecture acceleration …

A survey of hardware architectures for generative adversarial networks

N Shrivastava, MA Hanif, S Mittal, SR Sarangi… - Journal of Systems …, 2021 - Elsevier
Recent years have witnessed a significant interest in the “generative adversarial
networks”(GANs) due to their ability to generate high-fidelity data. Many models of GANs …

Tprune: Efficient transformer pruning for mobile devices

J Mao, H Yang, A Li, H Li, Y Chen - ACM Transactions on Cyber …, 2021 - dl.acm.org
The invention of Transformer model structure boosts the performance of Neural Machine
Translation (NMT) tasks to an unprecedented level. Many previous works have been done to …

Memristive GAN in analog

O Krestinskaya, B Choubey, AP James - Scientific reports, 2020 - nature.com
Abstract Generative Adversarial Network (GAN) requires extensive computing resources
making its implementation in edge devices with conventional microprocessor hardware a …

Inca: Input-stationary dataflow at outside-the-box thinking about deep learning accelerators

B Kim, S Li, H Li - 2023 IEEE International Symposium on High …, 2023 - ieeexplore.ieee.org
This paper first presents an input-stationary (IS) implemented crossbar accelerator (INCA),
supporting inference and training for deep neural networks (DNNs). Processing-in-memory …

PARC: A processing-in-CAM architecture for genomic long read pairwise alignment using ReRAM

F Chen, L Song, Y Chen - 2020 25th Asia and South Pacific …, 2020 - ieeexplore.ieee.org
Technological advances in long read sequences have greatly facilitated the development of
genomics. However, managing and analyzing the raw genomic data that outpaces Moore's …

Processing-in-memory technology for machine learning: From basic to asic

B Taylor, Q Zheng, Z Li, S Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Due to the need for computing models that can process large quantities of data efficiently
and with high throughput in many state-of-the-art machine learning algorithms, the …

Towards design methodology of efficient fast algorithms for accelerating generative adversarial networks on FPGAs

JW Chang, S Ahn, KW Kang… - 2020 25th Asia and South …, 2020 - ieeexplore.ieee.org
Generative adversarial networks (GANs) have shown excellent performance in image and
speech applications. GANs create impressive data primarily through a new type of operator …

ReBNN: in-situ acceleration of binarized neural networks in ReRAM using complementary resistive cell

L Song, Y Wu, X Qian, H Li, Y Chen - CCF Transactions on High …, 2019 - Springer
Resistive random access memory (ReRAM) has been proven capable to efficiently perform
in-situ matrix-vector computations in convolutional neural network (CNN) processing. The …