Ke Hong

Посилання

	Усі	З 2020
Цитування	247	247
h-індекс	8	8
i10-індекс	8	8

200

100

150

202120222023202420251 3 16 197 29

Доступні для всіх

Переглянути всі

5 статей

1 стаття

доступні

недоступні

За умовами фінансування

Підписатись

Ke Hong

Tsinghua University

Підтверджена електронна адреса в mails.tsinghua.edu.cn

efficient computing GPU acceleration sparse computing ML system


Назва Сортувати за цитуваннями Сортувати за роком Сортувати за назвою	Посилання Посилання	Рік
A survey on efficient inference for large language models Z Zhou, X Ning, K Hong, T Fu, J Xu, S Li, Y Lou, L Wang, Z Yuan, X Li, ... arXiv preprint arXiv:2404.14294, 2024	77	2024
Flashdecoding++: Faster large language model inference with asynchronization, flat gemm optimization, and heuristics K Hong, G Dai, J Xu, Q Mao, X Li, J Liu, Y Dong, Y Wang Proceedings of Machine Learning and Systems 6, 148-161, 2024	57*	2024
Torchsparse++: Efficient training and inference framework for sparse convolution on gpus H Tang, S Yang, Z Liu, K Hong, Z Yu, X Li, G Dai, Y Wang, S Han Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023	37*	2023
A learning-based AOA estimation method for device-free localization K Hong, T Wang, J Liu, Y Wang, Y Shen IEEE Communications Letters 26 (6), 1264-1267, 2022	21	2022
Llm-mq: Mixed-precision quantization for efficient llm deployment S Li, X Ning, K Hong, T Liu, L Wang, X Li, K Zhong, G Dai, H Yang, ... NeurIPS 2023 Efficient Natural Language and Speech Processing Workshop, 1-5, 2023	16	2023
Ada3d: Exploiting the spatial redundancy with adaptive inference for efficient 3d object detection T Zhao, X Ning, K Hong, Z Qiu, P Lu, Y Zhao, L Zhang, L Zhou, G Dai, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	13	2023
An efficient accelerator for point-based and voxel-based point cloud neural networks X Yang, T Fu, G Dai, S Zeng, K Zhong, K Hong, Y Wang 2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023	11	2023
Exploiting hardware utilization and adaptive dataflow for efficient sparse convolution in 3D point clouds K Hong, Z Yu, G Dai, X Yang, Y Lian, N Xu, Y Wang Proceedings of Machine Learning and Systems 5, 428-441, 2023	10	2023
Feasta: A flexible and efficient accelerator for sparse tensor algebra in machine learning K Zhong, Z Zhu, G Dai, H Wang, X Yang, H Zhang, J Si, Q Mao, S Zeng, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024	4	2024
A point transformer accelerator with fine-grained pipelines and distribution-aware dynamic FPS Y Lian, X Yang, K Hong, Y Wang, G Dai, N Xu 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 1-9, 2023	1	2023
MBQ: Modality-Balanced Quantization for Large Vision-Language Models S Li, Y Hu, X Ning, X Liu, K Hong, X Jia, X Li, Y Yan, P Ran, G Dai, S Yan, ... arXiv preprint arXiv:2412.19509, 2024		2024
A Point Transformer Accelerator With Distribution-Aware Heuristic Distance Calculation Y Lian, X Yang, K Hong, Y Wang, N Xu, G Dai IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024		2024

У даний момент система не може виконати операцію. Спробуйте пізніше.

Статті 1–12

Кількість бібліографічних посилань на рік

Повторювані посилання

Об’єднані посилання

Додати співавторівСпівавтори

Підписатись

Посилання