CoNDA: Efficient cache coherence support for near-data accelerators

A Boroumand, S Ghose, M Patel, H Hassan… - Proceedings of the 46th …, 2019 - dl.acm.org
Specialized on-chip accelerators are widely used to improve the energy efficiency of
computing systems. Recent advances in memory technology have enabled near-data …

iPIM: Programmable in-memory image processing accelerator using near-bank architecture

P Gu, X **e, Y Ding, G Chen, W Zhang… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Image processing is becoming an increasingly important domain for many applications on
workstations and the datacenter that require accelerators for high performance and energy …

Euphrates: Algorithm-soc co-design for low-power mobile continuous vision

Y Zhu, A Samajdar, M Mattina… - arxiv preprint arxiv …, 2018 - arxiv.org
Continuous computer vision (CV) tasks increasingly rely on convolutional neural networks
(CNN). However, CNNs have massive compute demands that far exceed the performance …

Mesorasi: Architecture support for point cloud analytics via delayed-aggregation

Y Feng, B Tian, T Xu, P Whatmough… - 2020 53rd Annual IEEE …, 2020 - ieeexplore.ieee.org
Point cloud analytics is poised to become a key workload on battery-powered embedded
and mobile platforms in a wide range of emerging application domains, such as …

Diffy: A Déjà vu-free differential deep neural network accelerator

M Mahmoud, K Siu, A Moshovos - 2018 51st Annual IEEE/ACM …, 2018 - ieeexplore.ieee.org
We show that Deep Convolutional Neural Network (CNN) implementations of computational
imaging tasks exhibit spatially correlated values. We exploit this correlation to reduce the …

ecnn: A block-based and highly-parallel cnn accelerator for edge inference

CT Huang, YC Ding, HC Wang, CW Weng… - Proceedings of the …, 2019 - dl.acm.org
Convolutional neural networks (CNNs) have recently demonstrated superior quality for
computational imaging applications. Therefore, they have great potential to revolutionize the …

Memristor-based light-weight transformer circuit implementation for speech recognizing

H **ao, Y Zhou, T Gao, S Duan… - IEEE Journal on …, 2023 - ieeexplore.ieee.org
Transformer network (TN) is a promising model widely used for natural language processing
(NLP), computer vision (CV), and audio processing (AP). However, the large number of …

Practical Mechanisms for Reducing Processor–Memory Data Movement in Modern Workloads

A Boroumand - 2020 - search.proquest.com
Data movement between the memory system and computation units is one of the most
critical challenges in designing high performance and energy-efficient computing systems …

Analog image denoising with an adaptive memristive crossbar network

O Krestinskaya, K Salama… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Noise in image sensors led to the development of a whole range of denoising filters. A noisy
image can become hard to recognize and often require several types of post-processing …

ImaGen: A general framework for generating memory-and power-efficient image processing accelerators

N Ujjainkar, J Leng, Y Zhu - … of the 50th Annual International Symposium …, 2023 - dl.acm.org
Image processing algorithms are prime targets for hardware acceleration as they are
commonly used in resource-and power-limited applications. Today's image processing …