A survey on the scheduling mechanisms in serverless computing: a taxonomy, challenges, and trends

M Ghorbian, M Ghobaei-Arani, L Esmaeili - Cluster Computing, 2024 - Springer
In recent years, serverless computing has received significant attention due to its innovative
approach to cloud computing. In this novel approach, a new payment model is presented …

Iris: interference and resource aware predictive orchestration for ml inference serving

A Ferikoglou, P Chrysomeris… - 2023 IEEE 16th …, 2023 - ieeexplore.ieee.org
Over the last years, the ever-growing number of Machine Learning (ML) and Artificial
Intelligence (AI) applications deployed in the Cloud has led to high demands on the …

SIRM: Cost efficient and SLO aware ML prediction on Fog-Cloud Network

C Phalak, D Chahal, R Singhal - 2023 15th International …, 2023 - ieeexplore.ieee.org
Cloud and Fog computing are complementary technologies used for complex Internet of
Things (IoT) based deployment of applications. With an increase in the number of internet …

Novel Contract-based Runtime Explainability Framework for End-to-End Ensemble Machine Learning Serving

MT Nguyen, HL Truong, T Truong-Huu - Proceedings of the IEEE/ACM …, 2024 - dl.acm.org
The growing complexity of end-to-end Machine Learning (ML) serving across the edge-
cloud continuum has raised the necessity for runtime explainability to support service …

PISeL: Pipelining DNN Inference for Serverless Computing

M Rahimi Jafari, J Su, Y Zhang, O Wang… - Proceedings of the 33rd …, 2024 - dl.acm.org
Serverless computing offers resource efficiency, cost efficiency, and a" pay-as-you-go"
pricing model, which makes it highly attractive to both users and cloud providers. However …

mSIRM: Cost-Efficient and SLO-aware ML Load Balancing on Fog and Multi-Cloud Network

C Phalak, D Chahal, M Ramesh… - … of the 13th Workshop on AI …, 2023 - dl.acm.org
The use of intelligent sensors and edge devices has grown exponentially for automation in
the industry to hyper-personalize applications, minimize cost, improve efficiency, and …

Optimizing Multiple Consumer-specific Objectives in End-to-End Ensemble Machine Learning Serving

T Nguyen, L Truong, P Arcaini… - IEEE/ACM International …, 2024 - research.aalto.fi
Optimizing the quality of machine learning (ML) services for individual consumers with
specific objectives is crucial for improving consumer satisfaction. In this context, end-to-end …

Machine Learning-Driven Strategies for Efficient Resource Management in Cloud Data Centers

D Mustafa - 2024 - spectrum.library.concordia.ca
Cloud computing is one of the major paradigms in the information technology industry,
offering diverse scalable on-demand services over the Internet. Nevertheless, managing …