DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference U Gupta, S Hsia, V Saraph, X Wang, B Reagen, GY Wei, HHS Lee, ... 2020 47th ACM/IEEE Annual International Symposium on Computer Architecture …, 2020 | 213 | 2020 |
RecSSD: near data processing for solid state drive based recommendation inference M Wilkening, U Gupta, S Hsia, C Trippel, CJ Wu, D Brooks, GY Wei 2021 26th ACM International Conference on Architectural Support for …, 2021 | 116 | 2021 |
RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance U Gupta, S Hsia, M Wilkening, J Pombra, HHS Lee, GY Wei, CJ Wu, ... 2021 54th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2021 | 44 | 2021 |
Cross-Stack Workload Characterization of Deep Recommendation Systems S Hsia, U Gupta, M Wilkening, CJ Wu, GY Wei, D Brooks 2020 IEEE International Symposium on Workload Characterization (IISWC), 2020 | 34 | 2020 |
MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation S Hsia, U Gupta, B Acun, N Ardalani, P Zhong, GY Wei, D Brooks, CJ Wu 2023 28th ACM International Conference on Architectural Support for …, 2023 | 17 | 2023 |
Generative AI beyond LLMs: system implications of multi-modal generation A Golden, S Hsia, F Sun, B Acun, B Hosmer, Y Lee, Z DeVito, J Johnson, ... 2024 IEEE International Symposium on Performance Analysis of Systems and …, 2024 | 10 | 2024 |
Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA MJ Adiletta, JJ Tithi, EI Farsarakis, G Gerogiannis, R Adolf, R Benke, ... 2023 IEEE International Symposium on Performance Analysis of Systems and …, 2023 | 9 | 2023 |
Is Flash Attention Stable? A Golden, S Hsia, F Sun, B Acun, B Hosmer, Y Lee, Z DeVito, J Johnson, ... 2024 9th Energy Efficient Machine Learning and Cognitive Computing Workshop …, 2024 | 6 | 2024 |
MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems S Hsia, A Golden, B Acun, N Ardalani, Z DeVito, GY Wei, D Brooks, CJ Wu 2024 51st ACM/IEEE Annual International Symposium on Computer Architecture …, 2024 | 4 | 2024 |
Architecting Efficient, Large-Scale AI: An Algorithm-System Co-Design Approach SCY Hsia Harvard University, 2024 | | 2024 |
Cross-Stack Characterization and Solid State Drive-Based Near Data Processing for Recommendation Workloads S Hsia, M Wilkening, U Gupta, C Trippel, Wu, Carole-Jean, D Brooks, ... 2021 Boston Area Architecture Workshop (BARC), 2021 | | 2021 |