CoNDA: Efficient cache coherence support for near-data accelerators
Specialized on-chip accelerators are widely used to improve the energy efficiency of
computing systems. Recent advances in memory technology have enabled near-data …
computing systems. Recent advances in memory technology have enabled near-data …
iPIM: Programmable in-memory image processing accelerator using near-bank architecture
Image processing is becoming an increasingly important domain for many applications on
workstations and the datacenter that require accelerators for high performance and energy …
workstations and the datacenter that require accelerators for high performance and energy …
Euphrates: Algorithm-soc co-design for low-power mobile continuous vision
Continuous computer vision (CV) tasks increasingly rely on convolutional neural networks
(CNN). However, CNNs have massive compute demands that far exceed the performance …
(CNN). However, CNNs have massive compute demands that far exceed the performance …
Mesorasi: Architecture support for point cloud analytics via delayed-aggregation
Point cloud analytics is poised to become a key workload on battery-powered embedded
and mobile platforms in a wide range of emerging application domains, such as …
and mobile platforms in a wide range of emerging application domains, such as …
Diffy: A Déjà vu-free differential deep neural network accelerator
We show that Deep Convolutional Neural Network (CNN) implementations of computational
imaging tasks exhibit spatially correlated values. We exploit this correlation to reduce the …
imaging tasks exhibit spatially correlated values. We exploit this correlation to reduce the …
ecnn: A block-based and highly-parallel cnn accelerator for edge inference
Convolutional neural networks (CNNs) have recently demonstrated superior quality for
computational imaging applications. Therefore, they have great potential to revolutionize the …
computational imaging applications. Therefore, they have great potential to revolutionize the …
Memristor-based light-weight transformer circuit implementation for speech recognizing
Transformer network (TN) is a promising model widely used for natural language processing
(NLP), computer vision (CV), and audio processing (AP). However, the large number of …
(NLP), computer vision (CV), and audio processing (AP). However, the large number of …
Practical Mechanisms for Reducing Processor–Memory Data Movement in Modern Workloads
A Boroumand - 2020 - search.proquest.com
Data movement between the memory system and computation units is one of the most
critical challenges in designing high performance and energy-efficient computing systems …
critical challenges in designing high performance and energy-efficient computing systems …
Analog image denoising with an adaptive memristive crossbar network
Noise in image sensors led to the development of a whole range of denoising filters. A noisy
image can become hard to recognize and often require several types of post-processing …
image can become hard to recognize and often require several types of post-processing …
ImaGen: A general framework for generating memory-and power-efficient image processing accelerators
Image processing algorithms are prime targets for hardware acceleration as they are
commonly used in resource-and power-limited applications. Today's image processing …
commonly used in resource-and power-limited applications. Today's image processing …