Designing cloud servers for lower carbon
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …
computing. We find that cloud compute servers cause the majority of emissions in a general …
BlitzCoin: Fully Decentralized hardware power management for accelerator-rich SoCs
On-chip power-management techniques have evolved over several processor generations.
However, response time and scalability constraints have made it difficult to translate existing …
However, response time and scalability constraints have made it difficult to translate existing …
Agile C-states: a core C-state architecture for latency critical applications optimizing both transition and cold-start latency
Latency-critical applications running in modern datacenters exhibit irregular request arrival
patterns and are implemented using multiple services with strict latency requirements (30 …
patterns and are implemented using multiple services with strict latency requirements (30 …
Leveraging Core and Uncore Frequency Scaling for Power-Efficient Serverless Workflows
Serverless workflows have emerged in FaaS platforms to represent the operational structure
of traditional applications. With latency propagation effects becoming increasingly …
of traditional applications. With latency propagation effects becoming increasingly …
Improving utilization of dataflow unit for multi-batch processing
Dataflow architectures can achieve much better performance and higher efficiency than
general-purpose core, approaching the performance of a specialized design while retaining …
general-purpose core, approaching the performance of a specialized design while retaining …
Mosaic: Harnessing the Micro-Architectural Resources of Servers in Serverless Environments
With serverless computing, users develop scalable applications using lightweight functions
as building blocks, while cloud providers own most of the computing stack, allowing for …
as building blocks, while cloud providers own most of the computing stack, allowing for …
Satisfying Energy-Efficiency Constraints for Mobile Systems
Energy-efficiency is one of the most important design criteria for mobile systems, such as
smartphones and tablets. But current mobile systems always over-provision resources to …
smartphones and tablets. But current mobile systems always over-provision resources to …
Agilepkgc: An agile system idle state architecture for energy proportional datacenter servers
Modern user-facing applications deployed in datacenters use a distributed system
architecture that exacerbates the latency requirements of their constituent microservices (30 …
architecture that exacerbates the latency requirements of their constituent microservices (30 …
HORSE: Ultra-low latency workloads on FaaS platforms
We investigate if FaaS platforms can handle ultra-low latency workloads that run as low as
less than 1 μs and show that even for a warm start, the initialization time takes up to 99, 99 …
less than 1 μs and show that even for a warm start, the initialization time takes up to 99, 99 …
Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference
Broad adoption of Large Language Models (LLM) demands rapid expansions of cloud LLM
inference clusters, leading to accumulation of embodied carbon $-$ the emissions from …
inference clusters, leading to accumulation of embodied carbon $-$ the emissions from …