Pond: Cxl-based memory pooling systems for cloud platforms
Public cloud providers seek to meet stringent performance requirements and low hardware
cost. A key driver of performance and cost is main memory. Memory pooling promises to …
cost. A key driver of performance and cost is main memory. Memory pooling promises to …
Designing cloud servers for lower carbon
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …
computing. We find that cloud compute servers cause the majority of emissions in a general …
Memory-harvesting vms in cloud platforms
loud platforms monetize their spare capacity by renting “Spot” virtual machines (VMs) that
can be evicted in favor of higher-priority VMs. Recent work has shown that resource …
can be evicted in favor of higher-priority VMs. Recent work has shown that resource …
Syrup: User-defined scheduling across the stack
Suboptimal scheduling decisions in operating systems, networking stacks, and application
runtimes are often responsible for poor application performance, including higher latency …
runtimes are often responsible for poor application performance, including higher latency …
Servermore: Opportunistic execution of serverless functions in the cloud
Serverless computing allows customers to submit their jobs to the cloud for execution, with
the resource provisioning being taken care of by the cloud provider. Serverless functions are …
the resource provisioning being taken care of by the cloud provider. Serverless functions are …
SOL: Safe on-node learning in cloud platforms
Cloud platforms run many software agents on each server node. These agents manage all
aspects of node operation, and in some cases frequently collect data and make decisions …
aspects of node operation, and in some cases frequently collect data and make decisions …
{BlockFlex}: Enabling Storage Harvesting with {Software-Defined} Flash in Modern Cloud Platforms
Cloud platforms today make efficient use of storage resources by slicing them among multi-
tenant applications on demand. However, our study discloses that the cloud storage is still …
tenant applications on demand. However, our study discloses that the cloud storage is still …
When ML Training Cuts Through Congestion: Just-in-Time Gradient Compression via Packet Trimming
Distributed training of ML models generates significant network traffic when exchanging
gradients and is sensitive to packet drops and retransmission caused by congestion when …
gradients and is sensitive to packet drops and retransmission caused by congestion when …
{SpotProxy}: Rediscovering the Cloud for Censorship Circumvention
Censorship circumvention is often fueled by supporters out of goodwill. However, hosting
circumvention proxies can be costly, especially when they are placed in the cloud. We argue …
circumvention proxies can be costly, especially when they are placed in the cloud. We argue …
Snape: Reliable and low-cost computing with mixture of spot and on-demand vms
Cloud providers often have resources that are not being fully utilized, and they may offer
them at a lower cost to make up for the reduced availability of these resources. However …
them at a lower cost to make up for the reduced availability of these resources. However …