Pond: Cxl-based memory pooling systems for cloud platforms

H Li, DS Berger, L Hsu, D Ernst, P Zardoshti… - Proceedings of the 28th …, 2023 - dl.acm.org
Public cloud providers seek to meet stringent performance requirements and low hardware
cost. A key driver of performance and cost is main memory. Memory pooling promises to …

Designing cloud servers for lower carbon

J Wang, DS Berger, F Kazhamiaka… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …

Memory-harvesting vms in cloud platforms

A Fuerst, S Novaković, Í Goiri, GI Chaudhry… - Proceedings of the 27th …, 2022 - dl.acm.org
loud platforms monetize their spare capacity by renting “Spot” virtual machines (VMs) that
can be evicted in favor of higher-priority VMs. Recent work has shown that resource …

Syrup: User-defined scheduling across the stack

K Kaffes, JT Humphries, D Mazières… - Proceedings of the ACM …, 2021 - dl.acm.org
Suboptimal scheduling decisions in operating systems, networking stacks, and application
runtimes are often responsible for poor application performance, including higher latency …

Servermore: Opportunistic execution of serverless functions in the cloud

A Suresh, A Gandhi - Proceedings of the ACM symposium on cloud …, 2021 - dl.acm.org
Serverless computing allows customers to submit their jobs to the cloud for execution, with
the resource provisioning being taken care of by the cloud provider. Serverless functions are …

SOL: Safe on-node learning in cloud platforms

Y Wang, D Crankshaw, NJ Yadwadkar… - Proceedings of the 27th …, 2022 - dl.acm.org
Cloud platforms run many software agents on each server node. These agents manage all
aspects of node operation, and in some cases frequently collect data and make decisions …

{BlockFlex}: Enabling Storage Harvesting with {Software-Defined} Flash in Modern Cloud Platforms

B Reidys, J Sun, A Badam, S Noghabi… - 16th USENIX Symposium …, 2022 - usenix.org
Cloud platforms today make efficient use of storage resources by slicing them among multi-
tenant applications on demand. However, our study discloses that the cloud storage is still …

When ML Training Cuts Through Congestion: Just-in-Time Gradient Compression via Packet Trimming

X Chen, S Vargaftik, RB Basat - Proceedings of the 23rd ACM Workshop …, 2024 - dl.acm.org
Distributed training of ML models generates significant network traffic when exchanging
gradients and is sensitive to packet drops and retransmission caused by congestion when …

{SpotProxy}: Rediscovering the Cloud for Censorship Circumvention

PTJ Kon, S Kamali, J Pei, D Barradas, A Chen… - 33rd USENIX Security …, 2024 - usenix.org
Censorship circumvention is often fueled by supporters out of goodwill. However, hosting
circumvention proxies can be costly, especially when they are placed in the cloud. We argue …

Snape: Reliable and low-cost computing with mixture of spot and on-demand vms

F Yang, L Wang, Z Xu, J Zhang, L Li, B Qiao… - Proceedings of the 28th …, 2023 - dl.acm.org
Cloud providers often have resources that are not being fully utilized, and they may offer
them at a lower cost to make up for the reduced availability of these resources. However …