Designing cloud servers for lower carbon
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …
computing. We find that cloud compute servers cause the majority of emissions in a general …
A cloud-scale characterization of remote procedure calls
The global scale and challenging requirements of modern cloud applications have led to the
development of complex, widely distributed, service-oriented applications. One enabler of …
development of complex, widely distributed, service-oriented applications. One enabler of …
Containerized microservices: A survey of resource management frameworks
The growing adoption of microservice architectures (MSAs) has led to major research and
development efforts to address their challenges and improve their performance, reliability …
development efforts to address their challenges and improve their performance, reliability …
Graft: Efficient inference serving for hybrid deep learning with SLO guarantees via DNN re-alignment
Deep neural networks (DNNs) have been widely adopted for various mobile inference tasks,
yet their ever-increasing computational demands are hindering their deployment on …
yet their ever-increasing computational demands are hindering their deployment on …
Adaptive QoS-aware microservice deployment with excessive loads via intra-and inter-datacenter scheduling
User-facing applications often experience excessive loads and are shifting towards the
microservice architecture. To fully utilize heterogeneous resources, current datacenters have …
microservice architecture. To fully utilize heterogeneous resources, current datacenters have …
Understanding and optimizing workloads for unified resource management in large cloud platforms
To fully utilize computing resources, cloud providers such as Google and Alibaba choose to
co-locate online services with batch processing applications in their data centers. By …
co-locate online services with batch processing applications in their data centers. By …
Nodens: Enabling Resource Efficient and Fast {QoS} Recovery of Dynamic Microservice Applications in Datacenters
Current microservice applications always meet with load and call graph dynamics. These
dynamics can easily lead to inappropriate resource allocation for microservices, and further …
dynamics can easily lead to inappropriate resource allocation for microservices, and further …
Pert-gnn: Latency prediction for microservice-based cloud-native applications via graph neural networks
Cloud-native applications using microservice architectures are rapidly replacing traditional
monolithic applications. To meet end-to-end QoS guarantees and enhance user experience …
monolithic applications. To meet end-to-end QoS guarantees and enhance user experience …
Integrating system state into spatio temporal graph neural network for microservice workload prediction
Microservice architecture has become a driving force in enhancing the modularity and
scalability of web applications, as evidenced by the Alipay platform's operational success …
scalability of web applications, as evidenced by the Alipay platform's operational success …
Grunt attack: Exploiting execution dependencies in microservices
Loosely-coupled and lightweight microservices running in containers are likely to form
complex execution dependencies inside the system. The execution dependency arises …
complex execution dependencies inside the system. The execution dependency arises …