עקוב אחר
Xusheng Chen
Xusheng Chen
Huawei Cloud
כתובת אימייל מאומתת בדומיין cs.hku.hk - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
Apus: Fast and scalable paxos on rdma
C Wang, J Jiang, X Chen, N Yi, H Cui
Proceedings of the 2017 Symposium on Cloud Computing, 94-107, 2017
1172017
Bidl: A high-throughput, low-latency permissioned blockchain framework for datacenter networks
J Qi, X Chen, Y Jiang, J Jiang, T Shen, S Zhao, S Wang, G Zhang, L Chen, ...
Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles …, 2021
442021
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training
S Zhao, F Li, X Chen, X Guan, J Jiang, D Huang, Y Qing, S Wang, P Wang, ...
IEEE Transactions on Parallel and Distributed Systems 33 (3), 489-506, 2021
392021
Inference without interference: Disaggregate llm inference for mixed downstream workloads
C Hu, H Huang, L Xu, X Chen, J Xu, S Chen, H Feng, C Wang, S Wang, ...
arXiv preprint arXiv:2401.11181, 2024
382024
{SOTER}: Guarding black-box inference for general neural networks at the edge
T Shen, J Qi, J Jiang, X Wang, S Wen, X Chen, S Zhao, S Wang, L Chen, ...
2022 USENIX Annual Technical Conference (USENIX ATC 22), 723-738, 2022
352022
Achieving low tail-latency and high scalability for serializable transactions in edge computing
X Chen, H Song, J Jiang, C Ruan, C Li, S Wang, G Zhang, R Cheng, ...
Proceedings of the Sixteenth European Conference on Computer Systems, 210-227, 2021
332021
Uranus: Simple, efficient sgx programming and its applications
J Jiang, X Chen, TO Li, C Wang, T Shen, S Zhao, H Cui, CL Wang, ...
Proceedings of the 15th ACM Asia Conference on Computer and Communications …, 2020
302020
CRONUS: Fault-isolated, secure and high-performance heterogeneous computing for trusted execution environment
J Jiang, J Qi, T Shen, X Chen, S Zhao, S Wang, L Chen, G Zhang, X Luo, ...
2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 124-143, 2022
232022
{PLOVER}: Fast, multi-core scalable virtual machine fault-tolerance
C Wang, X Chen, W Jia, B Li, H Qiu, S Zhao, H Cui
15th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2018
202018
Daenet: Making strong anonymity scale in a fully decentralized network
T Shen, J Jiang, Y Jiang, X Chen, J Qi, S Zhao, F Zhang, X Luo, H Cui
IEEE Transactions on Dependable and Secure Computing 19 (4), 2286-2303, 2021
192021
Efficient and DoS-resistant consensus for permissioned blockchains
X Chen, S Zhao, J Qi, J Jiang, H Song, C Wang, T On Li, TH Hubert Chan, ...
ACM SIGMETRICS Performance Evaluation Review 49 (3), 61-62, 2022
18*2022
Effectively Mitigating {I/O} Inactivity in {vCPU} Scheduling
W Jia, C Wang, X Chen, J Shan, X Shang, H Cui, X Ding, L Cheng, ...
2018 USENIX Annual Technical Conference (USENIX ATC 18), 267-280, 2018
182018
Memserve: Context caching for disaggregated llm serving with elastic memory pool
C Hu, H Huang, J Hu, J Xu, X Chen, T Xie, C Wang, S Wang, Y Bao, ...
arXiv preprint arXiv:2406.17565, 2024
162024
Fold3D: Rethinking and Parallelizing Computational and Communicational Tasks in the Training of Large DNN Models
F Li, S Zhao, Y Qing, X Chen, X Guan, S Wang, G Zhang, H Cui
IEEE Transactions on Parallel and Distributed Systems 34 (5), 1432-1449, 2023
132023
A geography-based p2p overlay network for fast and robust blockchain systems
H Qiu, T Ji, S Zhao, X Chen, J Qi, H Cui, S Wang
IEEE Transactions on Services Computing 16 (3), 1572-1588, 2022
112022
Naspipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism
S Zhao, F Li, X Chen, T Shen, L Chen, S Wang, N Zhang, C Li, H Cui
Proceedings of the 27th ACM International Conference on Architectural …, 2022
112022
Caraserve: Cpu-assisted and rank-aware lora serving for generative llm inference
S Li, H Lu, T Wu, M Yu, Q Weng, X Chen, Y Shan, B Yuan, W Wang
arXiv preprint arXiv:2401.11240, 2024
92024
A fast, general storage replication protocol for active-active virtual machine fault tolerance
C Wang, X Chen, Z Wang, Y Zhu, H Cui
2017 IEEE 23rd International Conference on Parallel and Distributed Systems …, 2017
72017
The cap principle for llm serving
P Zeng, Z Ning, J Zhao, W Cui, M Xu, L Guo, X Chen, Y Shan
arXiv e-prints, arXiv: 2405.11299, 2024
52024
Inference without interference: Disaggregate llm inference for mixed downstream workloads, 2024
C Hu, H Huang, L Xu, X Chen, J Xu, S Chen, H Feng, C Wang, S Wang, ...
URL https://arxiv. org/abs/2401.11181, 2024
52024
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–20