L2ight: Enabling on-chip learning for optical neural networks via efficient in-situ subspace optimization
Silicon-photonics-based optical neural network (ONN) is a promising hardware platform that
could represent a paradigm shift in efficient AI with its CMOS-compatibility, flexibility, ultra …
could represent a paradigm shift in efficient AI with its CMOS-compatibility, flexibility, ultra …
Frequency-domain inference acceleration for convolutional neural networks using ReRAMs
Convolutional neural networks (CNNs)(including 2D and 3D convolutions) are popular in
video analysis tasks such as action recognition and activity understanding. Fast algorithms …
video analysis tasks such as action recognition and activity understanding. Fast algorithms …
NDSEARCH: Accelerating graph-traversal-based approximate nearest neighbor search through near data processing
Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector
database and many data center applications, such as person re-identification and …
database and many data center applications, such as person re-identification and …
Hybrid Digital/Analog Memristor-based Computing Architecture for Sparse Deep Learning Acceleration
Fine-grained sparsity in recent bio-inspired models such as attention-based model could
reduce the computation complexity dramatically. However, the unique sparsity pattern …
reduce the computation complexity dramatically. However, the unique sparsity pattern …
Towards memory-efficient neural networks via multi-level in situ generation
Deep neural networks (DNN) have shown superior performance in a variety of tasks. As they
rapidly evolve, their escalating computation and memory demands make it challenging to …
rapidly evolve, their escalating computation and memory demands make it challenging to …
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators
Resistive Random-Access Memory (RRAM) is well-suited to accelerate neural network (NN)
workloads as RRAM-based Processing-in-Memory (PIM) architectures natively support …
workloads as RRAM-based Processing-in-Memory (PIM) architectures natively support …
Hybrid Digital/Analog In-Memory Computing
Q Zheng - 2024 - search.proquest.com
The relentless advancement of deep learning applications, particularly the highly potent yet
computationally intensive deep unsupervised learning models, is pushing the boundaries of …
computationally intensive deep unsupervised learning models, is pushing the boundaries of …
[KNJIGA][B] Accelerator Architectures for Deep Learning and Graph Processing
L Song - 2020 - search.proquest.com
Deep learning and graph processing are two big-data applications and they are widely
applied in many domains. The training of deep learning is essential for inference and has …
applied in many domains. The training of deep learning is essential for inference and has …
Parallelism in deep learning accelerators
Deep learning is the core of artificial intelligence and it achieves state-of-the-art in a wide
range of applications. The intensity of computation and data in deep learning processing …
range of applications. The intensity of computation and data in deep learning processing …