Research progress on memristor: From synapses to computing systems
As the limits of transistor technology are approached, feature size in integrated circuit
transistors has been reduced very near to the minimum physically-realizable channel length …
transistors has been reduced very near to the minimum physically-realizable channel length …
Accpar: Tensor partitioning for heterogeneous deep learning accelerators
Deep neural network (DNN) accelerators as an example of domain-specific architecture
have demonstrated great success in DNN inference. However, the architecture acceleration …
have demonstrated great success in DNN inference. However, the architecture acceleration …
A survey of hardware architectures for generative adversarial networks
Recent years have witnessed a significant interest in the “generative adversarial
networks”(GANs) due to their ability to generate high-fidelity data. Many models of GANs …
networks”(GANs) due to their ability to generate high-fidelity data. Many models of GANs …
Tprune: Efficient transformer pruning for mobile devices
The invention of Transformer model structure boosts the performance of Neural Machine
Translation (NMT) tasks to an unprecedented level. Many previous works have been done to …
Translation (NMT) tasks to an unprecedented level. Many previous works have been done to …
Memristive GAN in analog
Abstract Generative Adversarial Network (GAN) requires extensive computing resources
making its implementation in edge devices with conventional microprocessor hardware a …
making its implementation in edge devices with conventional microprocessor hardware a …
Inca: Input-stationary dataflow at outside-the-box thinking about deep learning accelerators
This paper first presents an input-stationary (IS) implemented crossbar accelerator (INCA),
supporting inference and training for deep neural networks (DNNs). Processing-in-memory …
supporting inference and training for deep neural networks (DNNs). Processing-in-memory …
PARC: A processing-in-CAM architecture for genomic long read pairwise alignment using ReRAM
Technological advances in long read sequences have greatly facilitated the development of
genomics. However, managing and analyzing the raw genomic data that outpaces Moore's …
genomics. However, managing and analyzing the raw genomic data that outpaces Moore's …
Processing-in-memory technology for machine learning: From basic to asic
Due to the need for computing models that can process large quantities of data efficiently
and with high throughput in many state-of-the-art machine learning algorithms, the …
and with high throughput in many state-of-the-art machine learning algorithms, the …
Towards design methodology of efficient fast algorithms for accelerating generative adversarial networks on FPGAs
Generative adversarial networks (GANs) have shown excellent performance in image and
speech applications. GANs create impressive data primarily through a new type of operator …
speech applications. GANs create impressive data primarily through a new type of operator …
ReBNN: in-situ acceleration of binarized neural networks in ReRAM using complementary resistive cell
Resistive random access memory (ReRAM) has been proven capable to efficiently perform
in-situ matrix-vector computations in convolutional neural network (CNN) processing. The …
in-situ matrix-vector computations in convolutional neural network (CNN) processing. The …