Παρακολούθηση
Baolin Li
Baolin Li
Northeastern University
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα northeastern.edu - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference
S Samsi, D Zhao, J McDonald, B Li, A Michaleas, M Jones, W Bergeron, ...
2023 IEEE High Performance Extreme Computing Conference (HPEC), 1-9, 2023
1192023
Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models
J McDonald, B Li, N Frey, D Tiwari, V Gadepally, S Samsi
arXiv preprint arXiv:2205.09646, 2022
592022
MISO: exploiting multi-instance GPU capability on multi-tenant GPU clusters
B Li, T Patel, S Samsi, V Gadepally, D Tiwari
Proceedings of the 13th Symposium on Cloud Computing, 173-189, 2022
582022
Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems
B Li, R Basu Roy, D Wang, S Samsi, V Gadepally, D Tiwari
Proceedings of the International Conference for High Performance Computing …, 2023
41*2023
Experimental evaluation of nisq quantum computers: error measurement, characterization, and implications
T Patel, A Potharaju, B Li, RB Roy, D Tiwari
SC20: International Conference for High Performance Computing, Networking …, 2020
412020
AI-Enabling Workloads on Large-Scale GPU-Accelerated System: Characterization, Opportunities, and Implications
B Li, R Arora, S Samsi, T Patel, W Arcand, D Bestor, C Byun, RB Roy, ...
2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022
402022
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
B Li, S Samsi, V Gadepally, D Tiwari
Proceedings of the International Conference for High Performance Computing …, 2023
37*2023
{UREQA}: Leveraging {Operation-Aware} Error Rates for Effective Quantum Circuit Mapping on {NISQ-Era} Quantum Computers
T Patel, B Li, RB Roy, D Tiwari
2020 USENIX Annual Technical Conference (USENIX ATC 20), 705-711, 2020
272020
The mit supercloud dataset
S Samsi, ML Weiss, D Bestor, B Li, M Jones, A Reuther, D Edelman, ...
2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021
242021
Characterizing Multi-Instance GPU for Machine Learning Workloads
B Li, V Gadepally, S Samsi, D Tiwari
2022 IEEE International Parallel and Distributed Processing Symposium …, 2022
222022
RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances
B Li, RB Roy, T Patel, V Gadepally, K Gettings, D Tiwari
Proceedings of the International Conference for High Performance Computing …, 2021
222021
Sprout: Green Generative AI with Carbon-Efficient LLM Inference
B Li, Y Jiang, V Gadepally, D Tiwari
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
18*2024
Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale
D Zhao, S Samsi, J McDonald, B Li, D Bestor, M Jones, D Tiwari, ...
Proceedings of the 2023 ACM Symposium on Cloud Computing, 588-596, 2023
162023
LLM Inference Serving: Survey of Recent Advances and Opportunities
B Li, Y Jiang, V Gadepally, D Tiwari
arXiv preprint arXiv:2407.12391, 2024
142024
Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
B Li, S Samsi, V Gadepally, D Tiwari
Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023
13*2023
Benchmarking resource usage for efficient distributed deep learning
NC Frey, B Li, J McDonald, D Zhao, M Jones, D Bestor, D Tiwari, ...
2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2022
122022
EcoLife: Carbon-Aware Serverless Function Scheduling for Sustainable Computing
Y Jiang, RB Roy, B Li, D Tiwari
SC24: International Conference for High Performance Computing, Networking …, 2024
42024
Serving Machine Learning Inference Using Heterogeneous Hardware
B Li, V Gadepally, S Samsi, M Veillette, D Tiwari
2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021
42021
The mit supercloud workload classification challenge
BJ Tang, Q Chen, ML Weiss, NC Frey, J McDonald, D Bestor, C Yee, ...
2022 IEEE International Parallel and Distributed Processing Symposium …, 2022
32022
Interventions to Reduce AI Energy Requirements
D Edelman, J McDonald, D Bestor, M Jones, B Li, D Tiwari, D Zhao, ...
HPCA NetZero, 2023
22023
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20