Følg
Zhekai Zhang
Zhekai Zhang
Verificeret mail på mit.edu
Titel
Citeret af
Citeret af
År
Once-for-all: Train one network and specialize it for efficient deployment
H Cai, C Gan, T Wang, Z Zhang, S Han
arXiv preprint arXiv:1908.09791, 2019
15362019
Spatten: Efficient sparse attention architecture with cascade token and head pruning
H Wang, Z Zhang, S Han
2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021
4042021
Sparch: Efficient architecture for sparse matrix multiplication
Z Zhang, H Wang, S Han, WJ Dally
2020 IEEE International Symposium on High Performance Computer Architecture …, 2020
2842020
Pointacc: Efficient point cloud accelerator
Y Lin, Z Zhang, H Tang, H Wang, S Han
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021
802021
Qserve: W4a8kv4 quantization and system co-design for efficient llm serving
Y Lin, H Tang, S Yang, Z Zhang, G Xiao, C Gan, S Han
arXiv preprint arXiv:2405.04532, 2024
462024
Lightening-transformer: A dynamically-operated optically-interconnected photonic transformer accelerator
H Zhu, J Gu, H Wang, Z Jiang, Z Zhang, R Tang, C Feng, S Han, RT Chen, ...
2024 IEEE International Symposium on High-Performance Computer Architecture …, 2024
162024
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
E Xie, J Chen, J Chen, H Cai, H Tang, Y Lin, Z Zhang, M Li, L Zhu, Y Lu, ...
arXiv preprint arXiv:2410.10629, 2024
112024
Svdquant: Absorbing outliers by low-rank components for 4-bit diffusion models
M Li, Y Lin, Z Zhang, T Cai, X Li, J Guo, E Xie, C Meng, JY Zhu, S Han
arXiv preprint arXiv:2411.05007, 2024
102024
VideoTime3: A 40-uJ/frame 38 FPS Video Understanding Accelerator With Real-Time DiffFrame Temporal Redundancy Reduction and Temporal Modeling
M Wang, Y Lin, Z Zhang, J Lin, S Han, AP Chandrakasan
IEEE Solid-State Circuits Letters 6, 169-172, 2023
22023
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–9