Designing cloud servers for lower carbon

J Wang, DS Berger, F Kazhamiaka… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …

BlitzCoin: Fully Decentralized hardware power management for accelerator-rich SoCs

M Cochet, K Swaminathan, E Loscalzo… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
On-chip power-management techniques have evolved over several processor generations.
However, response time and scalability constraints have made it difficult to translate existing …

Agile C-states: a core C-state architecture for latency critical applications optimizing both transition and cold-start latency

G Antoniou, D Bartolini, H Volos, M Kleanthous… - ACM Transactions on …, 2024 - dl.acm.org
Latency-critical applications running in modern datacenters exhibit irregular request arrival
patterns and are implemented using multiple services with strict latency requirements (30 …

Leveraging Core and Uncore Frequency Scaling for Power-Efficient Serverless Workflows

A Tzenetopoulos, D Masouros, S Xydis… - arxiv preprint arxiv …, 2024 - arxiv.org
Serverless workflows have emerged in FaaS platforms to represent the operational structure
of traditional applications. With latency propagation effects becoming increasingly …

Improving utilization of dataflow unit for multi-batch processing

Z Fan, W Li, Z Wang, Y Yang, X Ye, D Fan… - ACM Transactions on …, 2024 - dl.acm.org
Dataflow architectures can achieve much better performance and higher efficiency than
general-purpose core, approaching the performance of a specialized design while retaining …

Mosaic: Harnessing the Micro-Architectural Resources of Servers in Serverless Environments

J Stojkovic, E Choukse, E Saurez… - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
With serverless computing, users develop scalable applications using lightweight functions
as building blocks, while cloud providers own most of the computing stack, allowing for …

Satisfying Energy-Efficiency Constraints for Mobile Systems

X Li, S Hong, J Chen, J Ji, C Luo… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Energy-efficiency is one of the most important design criteria for mobile systems, such as
smartphones and tablets. But current mobile systems always over-provision resources to …

Agilepkgc: An agile system idle state architecture for energy proportional datacenter servers

G Antoniou, H Volos, DB Bartolini… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Modern user-facing applications deployed in datacenters use a distributed system
architecture that exacerbates the latency requirements of their constituent microservices (30 …

HORSE: Ultra-low latency workloads on FaaS platforms

D Mvondo, F Taiani, YD Bromberg - Proceedings of the 25th International …, 2024 - dl.acm.org
We investigate if FaaS platforms can handle ultra-low latency workloads that run as low as
less than 1 μs and show that even for a warm start, the initialization time takes up to 99, 99 …

Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference

TB Hewage, S Ilager, MR Read, R Buyya - arxiv preprint arxiv …, 2025 - arxiv.org
Broad adoption of Large Language Models (LLM) demands rapid expansions of cloud LLM
inference clusters, leading to accumulation of embodied carbon $-$ the emissions from …