Ikuti
Weiming Hu
Weiming Hu
Email yang diverifikasi di sjtu.edu.cn - Beranda
Judul
Dikutip oleh
Dikutip oleh
Tahun
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization
C Guo, J Tang, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu
Proceedings of the 50th Annual International Symposium on Computer …, 2023
892023
vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
J Xu, R Zhang, C Guo, W Hu, Z Liu, F Wu, Y Feng, S Sun, C Shao, Y Guo, ...
arXiv preprint arXiv:2407.15309, 2024
32024
Cache-locality Based Adaptive Warp Scheduling for Neural Network Acceleration on GPGPUs
W Hu, Y Zhou, Y Quan, Y Wang, X Lou
2022 IEEE 35th International System-on-Chip Conference (SOCC), 1-6, 2022
2022
Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.
Artikel 1–3