Machine learning techniques in emerging cloud computing integrated paradigms: A survey and taxonomy

D Soni, N Kumar - Journal of Network and Computer Applications, 2022 - Elsevier
Cloud computing offers Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and
Software as a Service (SaaS) to provide compute, network, and storage capabilities to the …

Elasticity in cloud computing: state of the art and research challenges

Y Al-Dhuraibi, F Paraiso, N Djarallah… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Elasticity is a fundamental property in cloud computing that has recently witnessed major
developments. This article reviews both classical and recent elasticity solutions and …

{MArk}: Exploiting cloud services for {Cost-Effective},{SLO-Aware} machine learning inference serving

C Zhang, M Yu, W Wang, F Yan - 2019 USENIX Annual Technical …, 2019 - usenix.org
The advances of Machine Learning (ML) have sparked a growing demand of ML-as-a-
Service: developers train ML models and publish them in the cloud as online services to …

A review of auto-scaling techniques for elastic applications in cloud environments

T Lorido-Botran, J Miguel-Alonso, JA Lozano - Journal of grid computing, 2014 - Springer
Cloud computing environments allow customers to dynamically scale their applications. The
key problem is how to lease the right amount of resources, on a pay-as-you-go basis …

Auto-scaling web applications in clouds: A taxonomy and survey

C Qu, RN Calheiros, R Buyya - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Web application providers have been migrating their applications to cloud data centers,
attracted by the emerging cloud computing paradigm. One of the appealing features of the …

A hierarchical framework of cloud resource allocation and power management using deep reinforcement learning

N Liu, Z Li, J Xu, Z Xu, S Lin, Q Qiu… - 2017 IEEE 37th …, 2017 - ieeexplore.ieee.org
Automatic decision-making approaches, such as reinforcement learning (RL), have been
applied to (partially) solve the resource allocation problem adaptively in the cloud computing …

Machine learning-based scaling management for kubernetes edge clusters

L Toka, G Dobreff, B Fodor… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Kubernetes, the container orchestrator for cloud-deployed applications, offers automatic
scaling for the application provider in order to meet the ever-changing intensity of …

Deep and reinforcement learning for automated task scheduling in large‐scale cloud computing systems

G Rjoub, J Bentahar, O Abdel Wahab… - Concurrency and …, 2021 - Wiley Online Library
Cloud computing is undeniably becoming the main computing and storage platform for
today's major workloads. From Internet of things and Industry 4.0 workloads to big data …

A load balancing and optimization strategy (LBOS) using reinforcement learning in fog computing environment

FM Talaat, MS Saraya, AI Saleh, HA Ali… - Journal of Ambient …, 2020 - Springer
Fog computing (FC) can be considered as a computing paradigm which performs Internet of
Things (IoT) applications at the edge of the network. Recently, there is a great growth of data …

Intelligent autoscaling of microservices in the cloud for real-time applications

AA Khaleq, I Ra - IEEE access, 2021 - ieeexplore.ieee.org
Cloud applications are becoming more containerized in nature. Develo** a cloud
application based on a microservice architecture imposes different challenges including …