Obserwuj
Xiang Yang
Tytuł
Cytowane przez
Cytowane przez
Rok
DeepCC: Multi-agent deep reinforcement learning congestion control for multi-path TCP based on self-attention
B He, J Wang, Q Qi, H Sun, J Liao, C Du, X Yang, Z Han
IEEE Transactions on Network and Service Management 18 (4), 4770-4788, 2021
442021
Poster: PipeLLM: Pipeline LLM inference on heterogeneous devices with sequence slicing
R Ma, J Wang, Q Qi, X Yang, H Sun, Z Zhuang, J Liao
Proceedings of the ACM SIGCOMM 2023 Conference, 1126-1128, 2023
182023
Towards efficient inference: Adaptively cooperate in heterogeneous iot edge cluster
X Yang, Q Qi, J Wang, S Guo, J Liao
2021 IEEE 41st International Conference on Distributed Computing Systems …, 2021
132021
Pico: Pipeline inference framework for versatile cnns on diverse mobile devices
X Yang, Z Xu, Q Qi, J Wang, H Sun, J Liao, S Guo
IEEE Transactions on Mobile Computing 23 (4), 2712-2730, 2023
112023
Following the correct direction: Renovating sparsified SGD towards global optimization in distributed edge learning
W Ning, H Sun, X Fu, X Yang, Q Qi, J Wang, J Liao, Z Han
IEEE Journal on Selected Areas in Communications 40 (2), 499-514, 2021
62021
Adaptive dnn surgery for selfish inference acceleration with on-demand edge resource
X Yang, D Chen, Q Qi, J Wang, H Sun, J Liao, S Guo
arXiv preprint arXiv:2306.12185, 2023
42023
HPipe: Large Language Model Pipeline Parallelism for Long Context on Heterogeneous Cost-effective Devices
R Ma, X Yang, J Wang, Q Qi, H Sun, J Wang, Z Zhuang, J Liao
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
32024
Shuffle-exchange brings faster: Reduce the idle time during communication for decentralized neural network training
X Yang
arXiv preprint arXiv:2007.00433, 2020
32020
Grouping synchronous to eliminate stragglers with edge computing in distributed deep learning
Z Gui, X Yang, H Yang, W Li, L Zhang, Q Qi, J Wang, H Sun, J Liao
2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2021
22021
Brief Announcement: Accelerate CNN Inference with Zoning Graph at Dynamic Granularity
R Ma, X Yang, Q Qi, J Wang, Z Zhuang, J Wang, X Wang
Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and …, 2023
12023
DeepZoning: Re-accelerate CNN Inference with Zoning Graph for Heterogeneous Edge Cluster
J Wang, R Ma, X Yang, Q Qi, Z Zhuang, J Wang, J Liao, S Guo
ACM Transactions on Architecture and Code Optimization, 2024
2024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–11