Photonic matrix multiplication lights up photonic accelerator and beyond

H Zhou, J Dong, J Cheng, W Dong, C Huang… - Light: Science & …, 2022 - nature.com
Matrix computation, as a fundamental building block of information processing in science
and technology, contributes most of the computational overheads in modern signal …

Prospects and applications of photonic neural networks

C Huang, VJ Sorger, M Miscuglio… - … in Physics: X, 2022 - Taylor & Francis
Neural networks have enabled applications in artificial intelligence through machine
learning, and neuromorphic computing. Software implementations of neural networks on …

Higher-dimensional processing using a photonic tensor core with continuous-time data

B Dong, S Aggarwal, W Zhou, UE Ali, N Farmakidis… - Nature …, 2023 - nature.com
New developments in hardware-based 'accelerators' range from electronic tensor cores and
memristor-based arrays to photonic implementations. The goal of these approaches is to …

A survey of design and optimization for systolic array-based dnn accelerators

R Xu, S Ma, Y Guo, D Li - ACM Computing Surveys, 2023 - dl.acm.org
In recent years, it has been witnessed that the systolic array is a successful architecture for
DNN hardware accelerators. However, the design of systolic arrays also encountered many …

Massively parallel amplitude-only Fourier neural network

M Miscuglio, Z Hu, S Li, JK George, R Capanna… - Optica, 2020 - opg.optica.org
Machine intelligence has become a driving factor in modern society. However, its demand
outpaces the underlying electronic technology due to limitations given by fundamental …

Large-scale optical neural networks based on photoelectric multiplication

R Hamerly, L Bernstein, A Sludds, M Soljačić… - Physical Review X, 2019 - APS
Recent success in deep neural networks has generated strong interest in hardware
accelerators to improve speed and energy consumption. This paper presents a new type of …

Digital electronics and analog photonics for convolutional neural networks (DEAP-CNNs)

V Bangari, BA Marquez, H Miller, AN Tait… - IEEE journal of …, 2019 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) are powerful and highly ubiquitous tools for
extracting features from large datasets for applications such as computer vision and natural …

Towards unified int8 training for convolutional neural network

F Zhu, R Gong, F Yu, X Liu, Y Wang… - Proceedings of the …, 2020 - openaccess.thecvf.com
Abstract Recently low-bit (eg, 8-bit) network quantization has been extensively studied to
accelerate the inference. Besides inference, low-bit training with quantized gradients can …

Welder: Scheduling deep learning memory access via tile-graph

Y Shi, Z Yang, J Xue, L Ma, Y **a, Z Miao… - … USENIX Symposium on …, 2023 - usenix.org
With the growing demand for processing higher fidelity data and the use of faster computing
cores in newer hardware accelerators, modern deep neural networks (DNNs) are becoming …

A survey of techniques for optimizing deep learning on GPUs

S Mittal, S Vaishay - Journal of Systems Architecture, 2019 - Elsevier
The rise of deep-learning (DL) has been fuelled by the improvements in accelerators. Due to
its unique features, the GPU continues to remain the most widely used accelerator for DL …