Dynamic sampling and selective masking for communication-efficient federated learning S Ji, W Jiang, A Walid, X Li arXiv:2003.09603, 2020, 2021 | 90 | 2021 |
MicroRec: efficient recommendation inference by hardware and data structure solutions W Jiang, Z He, S Zhang, TB Preußer, K Zeng, L Feng, J Zhang, T Liu, Y Li, ... MLSys'21: Proceedings of Machine Learning and Systems 3 (MLSys 2021), 2021 | 54* | 2021 |
Fleetrec: Large-scale recommendation inference on hybrid gpu-fpga clusters W Jiang, Z He, S Zhang, K Zeng, L Feng, J Zhang, T Liu, Y Li, J Zhou, ... KDD'21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery …, 2021 | 49 | 2021 |
Distributed recommendation inference on fpga clusters Y Zhu, Z He, W Jiang, K Zeng, J Zhou, G Alonso 2021 31st International Conference on Field-Programmable Logic and …, 2021 | 25 | 2021 |
PipeRAG: Fast retrieval-augmented generation via adaptive pipeline parallelism W Jiang, S Zhang, B Han, J Wang, YB Wang, T Kraska KDD'25: Proceedings of the 31th ACM SIGKDD Conference on Knowledge Discovery …, 2025 | 22* | 2025 |
Co-design Hardware and Algorithm for Vector Search W Jiang, S Li, Y Zhu, JF Licht, Z He, R Shi, C Renggli, S Zhang, ... SC'23: The International Conference for High Performance Computing …, 2023 | 18 | 2023 |
Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models W Jiang, M Zeller, R Waleffe, T Hoefler, G Alonso VLDB'25: Proceedings of the VLDB Endowment Volume 18, 2025 | 12* | 2025 |
Data-Informed Geometric Space Selection S Zhang, W Jiang NeurIPS'23: Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 12* | 2023 |
Data Processing with FPGAs on Modern Architectures W Jiang, D Korolija, G Alonso SIGMOD'23 Tutorial, 77-82, 2023 | 12 | 2023 |
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels Q Chen, X Geng, C Rosset, C Buractaon, J Lu, T Shen, K Zhou, C Xiong, ... WWW'24: Companion Proceedings of the ACM on Web Conference 2024, 292-301, 2024 | 4 | 2024 |
Swiftspatial: Spatial joins on modern hardware W Jiang, M Parvanov, G Alonso arXiv preprint arXiv:2309.16520, 2023 | 4 | 2023 |
Accelerating Graph-based Vector Search via Delayed-Synchronization Traversal W Jiang, H Hu, T Hoefler, G Alonso arXiv preprint arXiv:2406.12385, 2024 | 1 | 2024 |
Multi-Tenant SmartNICs for In-Network Preprocessing of Recommender Systems Y Zhu, W Jiang, G Alonso arXiv preprint arXiv:2501.12032, 2025 | | 2025 |
Efficient Tabular Data Preprocessing of ML Pipelines Y Zhu, W Jiang, G Alonso arXiv preprint arXiv:2409.14912, 2024 | | 2024 |