Volgen
Jinhao Li
Titel
Geciteerd door
Geciteerd door
Jaar
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
J Li, J Xu, S Huang, Y Chen, W Li, J Liu, Y Lian, J Pan, L Ding, H Zhou, ...
arXiv preprint arXiv:2410.04466, 2024
62024
A 4Gbps DPPM on-chip serial link based on pipelined Vernier-Tdc
J Li, C Qu, F Wu, J Jiang
2020 IEEE 15th International Conference on Solid-State & Integrated Circuit …, 2020
32020
Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization
J Li, J Xu, S Li, S Huang, Y Lian, J Liu, G Dai
arXiv preprint arXiv:2311.16442, 2023
2*2023
Marca: Mamba accelerator with reconfigurable architecture
J Li, S Huang, J Xu, J Liu, L Ding, N Xu, G Dai
arXiv preprint arXiv:2409.11440, 2024
12024
A gain reconfigurable time difference amplifier with self-adaptive linearity control
J Li, J Jiang, Q Wang, N Jing, W Sheng, G He
Analog Integrated Circuits and Signal Processing 107, 435-449, 2021
12021
Enabling Efficient Sparse Multiplications on GPUs with Heuristic Adaptability
J Xu, S Huang, J Li, G Huang, Y Xie, Y Wang, G Dai
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024
2024
SoftmAP: Software-Hardware Co-design for Integer-Only Softmax on Associative Processors
M Rakka, J Li, G Dai, A Eltawil, ME Fouda, F Kurdahi
arXiv preprint arXiv:2411.17847, 2024
2024
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–7