Spatten: Efficient sparse attention architecture with cascade token and head pruning

H Wang, Z Zhang, S Han - 2021 IEEE International Symposium …, 2021 - ieeexplore.ieee.org
The attention mechanism is becoming increasingly popular in Natural Language Processing
(NLP) applications, showing superior performance than convolutional and recurrent …

Instant-3d: Instant neural radiance field training towards on-device ar/vr 3d reconstruction

S Li, C Li, W Zhu, B Yu, Y Zhao, C Wan, H You… - Proceedings of the 50th …, 2023 - dl.acm.org
Neural Radiance Field (NeRF) based 3D reconstruction is highly desirable for immersive
Augmented and Virtual Reality (AR/VR) applications, but achieving instant (ie,< 5 seconds) …

Hw-nas-bench: Hardware-aware neural architecture search benchmark

C Li, Z Yu, Y Fu, Y Zhang, Y Zhao, H You, Q Yu… - arxiv preprint arxiv …, 2021 - arxiv.org
HardWare-aware Neural Architecture Search (HW-NAS) has recently gained tremendous
attention by automating the design of DNNs deployed in more resource-constrained daily …

[HTML][HTML] Recent developments in low-power AI accelerators: A survey

C Åleskog, H Grahn, A Borg - Algorithms, 2022 - mdpi.com
As machine learning and AI continue to rapidly develop, and with the ever-closer end of
Moore's law, new avenues and novel ideas in architecture design are being created and …

Shiftaddnet: A hardware-inspired deep network

H You, X Chen, Y Zhang, C Li, S Li… - Advances in …, 2020 - proceedings.neurips.cc
Multiplication (eg, convolution) is arguably a cornerstone of modern deep neural networks
(DNNs). However, intensive multiplications cause expensive resource costs that challenge …

Gan slimming: All-in-one gan compression by a unified optimization framework

H Wang, S Gui, H Yang, J Liu, Z Wang - European Conference on …, 2020 - Springer
Generative adversarial networks (GANs) have gained increasing popularity in various
computer vision applications, and recently start to be deployed to resource-constrained …

Energy-efficient computing-in-memory architecture for AI processor: device, circuit, architecture perspective

L Chang, C Li, Z Zhang, J **ao, Q Liu, Z Zhu… - Science China …, 2021 - Springer
An artificial intelligence (AI) processor is a promising solution for energy-efficient data
processing, including health monitoring and image/voice recognition. However, data …

" BNN-BN=?": Training Binary Neural Networks Without Batch Normalization

T Chen, Z Zhang, X Ouyang, Z Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art
binary neural networks (BNN). However, the BN layer is costly to calculate and is typically …

G-CoS: GNN-accelerator co-search towards both better accuracy and efficiency

Y Zhang, H You, Y Fu, T Geng, A Li… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have emerged as the state-of-the-art (SOTA) method for
graph-based learning tasks. However, it still remains prohibitively challenging to inference …

When the metaverse meets carbon neutrality: ongoing efforts and directions

F Liu, Q Pei, S Chen, Y Yuan, L Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
The metaverse has recently gained increasing attention from the public. It builds up a virtual
world where we can live as a new role regardless of the role we play in the physical world …