Designing cloud servers for lower carbon

J Wang, DS Berger, F Kazhamiaka… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …

A cloud-scale characterization of remote procedure calls

K Seemakhupt, BE Stephens, S Khan, S Liu… - Proceedings of the 29th …, 2023 - dl.acm.org
The global scale and challenging requirements of modern cloud applications have led to the
development of complex, widely distributed, service-oriented applications. One enabler of …

Containerized microservices: A survey of resource management frameworks

LM Al Qassem, T Stouraitis, E Damiani… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The growing adoption of microservice architectures (MSAs) has led to major research and
development efforts to address their challenges and improve their performance, reliability …

Graft: Efficient inference serving for hybrid deep learning with SLO guarantees via DNN re-alignment

J Wu, L Wang, Q **, F Liu - IEEE Transactions on Parallel and …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) have been widely adopted for various mobile inference tasks,
yet their ever-increasing computational demands are hindering their deployment on …

Adaptive QoS-aware microservice deployment with excessive loads via intra-and inter-datacenter scheduling

J Shi, K Fu, J Wang, Q Chen, D Zeng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
User-facing applications often experience excessive loads and are shifting towards the
microservice architecture. To fully utilize heterogeneous resources, current datacenters have …

Understanding and optimizing workloads for unified resource management in large cloud platforms

C Lu, H Xu, K Ye, G Xu, L Zhang, G Yang… - Proceedings of the …, 2023 - dl.acm.org
To fully utilize computing resources, cloud providers such as Google and Alibaba choose to
co-locate online services with batch processing applications in their data centers. By …

Nodens: Enabling Resource Efficient and Fast {QoS} Recovery of Dynamic Microservice Applications in Datacenters

J Shi, H Zhang, Z Tong, Q Chen, K Fu… - 2023 USENIX Annual …, 2023 - usenix.org
Current microservice applications always meet with load and call graph dynamics. These
dynamics can easily lead to inappropriate resource allocation for microservices, and further …

Pert-gnn: Latency prediction for microservice-based cloud-native applications via graph neural networks

DSH Tam, Y Liu, H Xu, S **e, WC Lau - Proceedings of the 29th ACM …, 2023 - dl.acm.org
Cloud-native applications using microservice architectures are rapidly replacing traditional
monolithic applications. To meet end-to-end QoS guarantees and enhance user experience …

Integrating system state into spatio temporal graph neural network for microservice workload prediction

Y Luo, M Gao, Z Yu, H Ge, X Gao, T Cai… - Proceedings of the 30th …, 2024 - dl.acm.org
Microservice architecture has become a driving force in enhancing the modularity and
scalability of web applications, as evidenced by the Alipay platform's operational success …

Grunt attack: Exploiting execution dependencies in microservices

X Gu, Q Wang, J Liu, J Wei - 2024 54th Annual IEEE/IFIP …, 2024 - ieeexplore.ieee.org
Loosely-coupled and lightweight microservices running in containers are likely to form
complex execution dependencies inside the system. The execution dependency arises …