From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference S Samsi, D Zhao, J McDonald, B Li, A Michaleas, M Jones, W Bergeron, ... 2023 IEEE High Performance Extreme Computing Conference (HPEC), 1-9, 2023 | 119 | 2023 |
Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models J McDonald, B Li, N Frey, D Tiwari, V Gadepally, S Samsi arXiv preprint arXiv:2205.09646, 2022 | 59 | 2022 |
MISO: exploiting multi-instance GPU capability on multi-tenant GPU clusters B Li, T Patel, S Samsi, V Gadepally, D Tiwari Proceedings of the 13th Symposium on Cloud Computing, 173-189, 2022 | 58 | 2022 |
Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems B Li, R Basu Roy, D Wang, S Samsi, V Gadepally, D Tiwari Proceedings of the International Conference for High Performance Computing …, 2023 | 41* | 2023 |
Experimental evaluation of nisq quantum computers: error measurement, characterization, and implications T Patel, A Potharaju, B Li, RB Roy, D Tiwari SC20: International Conference for High Performance Computing, Networking …, 2020 | 41 | 2020 |
AI-Enabling Workloads on Large-Scale GPU-Accelerated System: Characterization, Opportunities, and Implications B Li, R Arora, S Samsi, T Patel, W Arcand, D Bestor, C Byun, RB Roy, ... 2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022 | 40 | 2022 |
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service B Li, S Samsi, V Gadepally, D Tiwari Proceedings of the International Conference for High Performance Computing …, 2023 | 37* | 2023 |
{UREQA}: Leveraging {Operation-Aware} Error Rates for Effective Quantum Circuit Mapping on {NISQ-Era} Quantum Computers T Patel, B Li, RB Roy, D Tiwari 2020 USENIX Annual Technical Conference (USENIX ATC 20), 705-711, 2020 | 27 | 2020 |
The mit supercloud dataset S Samsi, ML Weiss, D Bestor, B Li, M Jones, A Reuther, D Edelman, ... 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 24 | 2021 |
Characterizing Multi-Instance GPU for Machine Learning Workloads B Li, V Gadepally, S Samsi, D Tiwari 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 22 | 2022 |
RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances B Li, RB Roy, T Patel, V Gadepally, K Gettings, D Tiwari Proceedings of the International Conference for High Performance Computing …, 2021 | 22 | 2021 |
Sprout: Green Generative AI with Carbon-Efficient LLM Inference B Li, Y Jiang, V Gadepally, D Tiwari Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 18* | 2024 |
Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale D Zhao, S Samsi, J McDonald, B Li, D Bestor, M Jones, D Tiwari, ... Proceedings of the 2023 ACM Symposium on Cloud Computing, 588-596, 2023 | 16 | 2023 |
LLM Inference Serving: Survey of Recent Advances and Opportunities B Li, Y Jiang, V Gadepally, D Tiwari arXiv preprint arXiv:2407.12391, 2024 | 14 | 2024 |
Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources B Li, S Samsi, V Gadepally, D Tiwari Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023 | 13* | 2023 |
Benchmarking resource usage for efficient distributed deep learning NC Frey, B Li, J McDonald, D Zhao, M Jones, D Bestor, D Tiwari, ... 2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2022 | 12 | 2022 |
EcoLife: Carbon-Aware Serverless Function Scheduling for Sustainable Computing Y Jiang, RB Roy, B Li, D Tiwari SC24: International Conference for High Performance Computing, Networking …, 2024 | 4 | 2024 |
Serving Machine Learning Inference Using Heterogeneous Hardware B Li, V Gadepally, S Samsi, M Veillette, D Tiwari 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 4 | 2021 |
The mit supercloud workload classification challenge BJ Tang, Q Chen, ML Weiss, NC Frey, J McDonald, D Bestor, C Yee, ... 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 3 | 2022 |
Interventions to Reduce AI Energy Requirements D Edelman, J McDonald, D Bestor, M Jones, B Li, D Tiwari, D Zhao, ... HPCA NetZero, 2023 | 2 | 2023 |