A full-stack search technique for domain optimized deep learning accelerators

D Zhang, S Huda, E Songhori, K Prabhu, Q Le… - Proceedings of the 27th …, 2022 - dl.acm.org
The rapidly-changing deep learning landscape presents a unique opportunity for building
inference accelerators optimized for specific datacenter-scale workloads. We propose Full …

Dosa: Differentiable model-based one-loop search for dnn accelerators

C Hong, Q Huang, G Dinh, M Subedar… - Proceedings of the 56th …, 2023 - dl.acm.org
In the hardware design space exploration process, it is critical to optimize both hardware
parameters and algorithm-to-hardware map**s. Previous work has largely approached …

Open source vizier: Distributed infrastructure and api for reliable and flexible blackbox optimization

X Song, S Perel, C Lee, G Kochanski… - International …, 2022 - proceedings.mlr.press
Vizier is the de-facto blackbox optimization service across Google, having optimized some of
Google's largest products and research efforts. To operate at the scale of tuning thousands …

An evaluation of edge tpu accelerators for convolutional neural networks

K Seshadri, B Akin, J Laudon… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Edge TPUs are a domain of accelerators for low-power, edge devices and are widely used
in various Google products such as Coral and Pixel devices. In this paper, we first discuss …

Learning a continuous and reconstructible latent space for hardware accelerator design

Q Huang, C Hong, J Wawrzynek… - … Analysis of Systems …, 2022 - ieeexplore.ieee.org
The hardware design space is high-dimensional and discrete. Systematic and efficient
exploration of this space has been a significant challenge. Central to this problem is the …

Chimera: A hybrid machine learning-driven multi-objective design space exploration tool for fpga high-level synthesis

M Yu, S Huang, D Chen - … and Automated Learning–IDEAL 2021: 22nd …, 2021 - Springer
In recent years, hardware accelerators based on field programmable gate arrays (FPGA)
have been widely applied and the high-level synthesis (HLS) tools were created to facilitate …

Acdse: A design space exploration method for cnn accelerator based on adaptive compression mechanism

K Feng, X Fan, J An, C Li, K Di, J Li - ACM Transactions on Embedded …, 2023 - dl.acm.org
Customized accelerators for Convolutional Neural Network (CNN) can achieve better
energy efficiency than general computing platforms. However, the design of a high …