L2ight: Enabling on-chip learning for optical neural networks via efficient in-situ subspace optimization

J Gu, H Zhu, C Feng, Z Jiang… - Advances in Neural …, 2021 - proceedings.neurips.cc
Silicon-photonics-based optical neural network (ONN) is a promising hardware platform that
could represent a paradigm shift in efficient AI with its CMOS-compatibility, flexibility, ultra …

Frequency-domain inference acceleration for convolutional neural networks using ReRAMs

B Liu, Z Jiang, Y Wu, J Wu, X Chen… - … on Parallel and …, 2023 - ieeexplore.ieee.org
Convolutional neural networks (CNNs)(including 2D and 3D convolutions) are popular in
video analysis tasks such as action recognition and activity understanding. Fast algorithms …

NDSEARCH: Accelerating graph-traversal-based approximate nearest neighbor search through near data processing

Y Wang, S Li, Q Zheng, L Song, Z Li… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector
database and many data center applications, such as person re-identification and …

Hybrid Digital/Analog Memristor-based Computing Architecture for Sparse Deep Learning Acceleration

Q Zheng, S Li, Y Wang, Z Li… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Fine-grained sparsity in recent bio-inspired models such as attention-based model could
reduce the computation complexity dramatically. However, the unique sparsity pattern …

Towards memory-efficient neural networks via multi-level in situ generation

J Gu, H Zhu, C Feng, M Liu, Z Jiang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Deep neural networks (DNN) have shown superior performance in a variety of tasks. As they
rapidly evolve, their escalating computation and memory demands make it challenging to …

NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators

A Manglik, M Patel, H Mao, B Salami, J Park… - arxiv preprint arxiv …, 2022 - arxiv.org
Resistive Random-Access Memory (RRAM) is well-suited to accelerate neural network (NN)
workloads as RRAM-based Processing-in-Memory (PIM) architectures natively support …

Hybrid Digital/Analog In-Memory Computing

Q Zheng - 2024 - search.proquest.com
The relentless advancement of deep learning applications, particularly the highly potent yet
computationally intensive deep unsupervised learning models, is pushing the boundaries of …

[KNJIGA][B] Accelerator Architectures for Deep Learning and Graph Processing

L Song - 2020 - search.proquest.com
Deep learning and graph processing are two big-data applications and they are widely
applied in many domains. The training of deep learning is essential for inference and has …

Parallelism in deep learning accelerators

L Song, F Chen, Y Chen, H Li - 2020 25th Asia and South …, 2020 - ieeexplore.ieee.org
Deep learning is the core of artificial intelligence and it achieves state-of-the-art in a wide
range of applications. The intensity of computation and data in deep learning processing …