Прати
Daliang Xu
Daliang Xu
Верификована је имејл адреса на pku.edu.cn - Почетна страница
Наслов
Навело
Навело
Година
A survey of resource-efficient llm and multimodal foundation models
M Xu, W Yin, D Cai, R Yi, D Xu, Q Wang, B Wu, Y Zhao, C Yang, S Wang, ...
arXiv preprint arXiv:2401.08092, 2024
992024
Mandheling: Mixed-precision on-device dnn training with dsp offloading
D Xu, M Xu, Q Wang, S Wang, Y Ma, K Huang, G Huang, X Jin, X Liu
Proceedings of the 28th Annual International Conference on Mobile Computing …, 2022
532022
Llmcad: Fast and scalable on-device large language model inference
D Xu, W Yin, X Jin, Y Zhang, S Wei, M Xu, X Liu
arXiv preprint arXiv:2309.04255, 2023
502023
Empowering 1000 tokens/second on-device llm prefilling with mllm-npu
D Xu, H Zhang, L Yang, R Liu, G Huang, M Xu, X Liu
arXiv preprint arXiv:2407.05858, 2024
152024
Elms: Elasticized large language models on mobile devices
W Yin, R Yi, D Xu, G Huang, M Xu, X Liu
arXiv preprint arXiv:2409.09071, 2024
52024
Wip: Efficient llm prefilling with mobile npu
D Xu, H Zhang, L Yang, R Liu, M Xu, X Liu
Proceedings of the Workshop on Edge and Mobile Foundation Models, 33-35, 2024
42024
Towards energy-efficient federated learning via int8-based training on mobile dsps
J Yuan, S Wang, H Li, D Xu, Y Li, M Xu, X Liu
Proceedings of the ACM Web Conference 2024, 2786-2794, 2024
42024
Socflow: Efficient and scalable dnn training on soc-clustered edge servers
D Xu, M Xu, C Lou, L Zhang, G Huang, X Jin, X Liu
Proceedings of the 29th ACM International Conference on Architectural …, 2024
32024
Fast On-device LLM Inference with NPUs
D Xu, H Zhang, L Yang, R Liu, G Huang, M Xu, X Liu
Proceedings of the 30th ACM International Conference on Architectural …, 2025
12025
EdgeLLM: Fast On-device LLM Inference with Speculative Decoding
D Xu, W Yin, H Zhang, X Jin, Y Zhang, S Wei, M Xu, X Liu
IEEE Transactions on Mobile Computing, 2024
12024
Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers
M Xu, D Xu, C Lou, L Zhang, G Huang, X Jin, X Liu
IEEE Transactions on Mobile Computing, 2024
12024
Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors
D Xu, Q Li, M Xu, K Huang, G Huang, S Wang, X Jin, Y Ma, X Liu
International Conference on Service-Oriented Computing, 67-85, 2023
12023
PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks
W Yin, D Xu, G Huang, Y Zhang, S Wei, M Xu, X Liu
Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems …, 2024
2024
Peking University, Beijing 1000871, China {liqingpostdoc, xudaliang}@ pku. edu. cn
Q Li, D Xu
Service-Oriented Computing–ICSOC 2023 Workshops: AI-PA, ASOCA, SAPD, SQS …, 2024
2024
Satellite Computing: From Space to Your Screen
Q Li, D Xu
International Conference on Service-Oriented Computing, 343-349, 2023
2023
S3Library: Automatically Eliminating C/C++ Buffer Overflow using Compatible Safer Libraries
K Sun, D Xu, D Chen, X Cheng, D Tong
arXiv preprint arXiv:2004.09062, 2020
2020
DangKiller: Eliminating Dangling Pointers Efficiently via Implicit Identifier
D Xu, D Chen, C Yang, X Cheng, D Tong
arXiv preprint arXiv:2003.00175, 2020
2020
Saturation Memory Access: Mitigating Memory Spatial Errors without Terminating Programs
D Chen, D Xu, D Tong, K Sun, X Guan, C Yang, X Cheng
arXiv preprint arXiv:2002.02831, 2020
2020
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–18