A survey on scheduling techniques in computing and network convergence

S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …

Llmcarbon: Modeling the end-to-end carbon footprint of large language models

A Faiz, S Kaneda, R Wang, R Osi, P Sharma… - arxiv preprint arxiv …, 2023 - arxiv.org
The carbon footprint associated with large language models (LLMs) is a significant concern,
encompassing emissions from their training, inference, experimentation, and storage …

[HTML][HTML] A review on the decarbonization of high-performance computing centers

CA Silva, R Vilaça, A Pereira, RJ Bessa - Renewable and Sustainable …, 2024 - Elsevier
High-performance computing relies on performance-oriented infrastructures with access to
powerful computing resources to complete tasks that contribute to solve complex problems …

Designing cloud servers for lower carbon

J Wang, DS Berger, F Kazhamiaka… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …

Characterization of large language model development in the datacenter

Q Hu, Z Ye, Z Wang, G Wang, M Zhang… - … USENIX Symposium on …, 2024 - usenix.org
Large Language Models (LLMs) have presented impressive performance across several
transformative tasks. However, it is non-trivial to efficiently utilize large-scale cluster …

Reducing the Carbon Impact of Generative AI Inference (today and in 2035)

AA Chien, L Lin, H Nguyen, V Rao, T Sharma… - Proceedings of the 2nd …, 2023 - dl.acm.org
Generative AI, exemplified in ChatGPT, Dall-E 2, and Stable Diffusion, are exciting new
applications consuming growing quantities of computing. We study the compute, energy …

Carbonscaler: Leveraging cloud workload elasticity for optimizing carbon-efficiency

WA Hanafy, Q Liang, N Bashir, D Irwin… - Proceedings of the ACM …, 2023 - dl.acm.org
Cloud platforms are increasing their emphasis on sustainability and reducing their
operational carbon footprint. A common approach for reducing carbon emissions is to exploit …

Toward sustainable hpc: Carbon footprint estimation and environmental implications of hpc systems

B Li, R Basu Roy, D Wang, S Samsi… - Proceedings of the …, 2023 - dl.acm.org
The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which
requires urgent intervention. In this work, we present a comprehensive analysis of the …

{FairyWREN}: A Sustainable Cache for Emerging {Write-Read-Erase} Flash Interfaces

S McAllister, B Berg, DS Berger… - … USENIX Symposium on …, 2024 - usenix.org
Datacenters need to reduce embodied carbon emissions, particularly for flash, which
accounts for 40% of embodied carbon in servers. However, decreasing flash's embodied …

Clover: Toward sustainable ai with carbon-aware machine learning inference service

B Li, S Samsi, V Gadepally, D Tiwari - Proceedings of the International …, 2023 - dl.acm.org
This paper presents a solution to the challenge of mitigating carbon emissions from hosting
large-scale machine learning (ML) inference services. ML inference is critical to modern …