Instinfer: In-storage attention offloading for cost-effective long-context llm inference X Pan, E Li, Q Li, S Liang, Y Shan, K Zhou, Y Luo, X Wang, J Zhang arXiv preprint arXiv:2409.04992, 2024 | 9 | 2024 |
StreamPIM: Streaming Matrix Computation in Racetrack Memory Y An, Y Tang, S Yi, L Peng, X Pan, G Sun, Z Luo, Q Li, J Zhang 2024 IEEE International Symposium on High-Performance Computer Architecture …, 2024 | 4 | 2024 |
BeaconGNN: Large-Scale GNN Acceleration with Out-of-Order Streaming In-Storage Computing Y Wang, X Pan, Y An, J Zhang, G Reinman 2024 IEEE International Symposium on High-Performance Computer Architecture …, 2024 | 3 | 2024 |
Flagger: Cooperative acceleration for large-scale cross-silo federated learning aggregation X Pan, Y An, S Liang, B Mao, M Zhang, Q Li, M Jung, J Zhang 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture …, 2024 | 2 | 2024 |
{ScalaAFA}: Constructing {User-Space}{All-Flash} Array Engine with Holistic Designs S Yi, X Pan, Q Li, Q Li, C Wang, B Mao, M Jung, J Zhang 2024 USENIX Annual Technical Conference (USENIX ATC 24), 141-156, 2024 | 1 | 2024 |
BcBench: Exploring Throughput Processor Designs based on Blockchain Benchmarking X Pan, Y Chen, S Yi, J Zhang Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, 88-97, 2023 | 1 | 2023 |
A Survey on User-Space Storage and Its Implementations J Li, X Pan, S Yi, J Zhang arXiv preprint arXiv:2306.10503, 2023 | | 2023 |