Lucid: A non-intrusive, scalable and interpretable scheduler for deep learning training jobs

Q Hu, M Zhang, P Sun, Y Wen, T Zhang - Proceedings of the 28th ACM …, 2023 - dl.acm.org
While recent deep learning workload schedulers exhibit excellent performance, it is arduous
to deploy them in practice due to some substantial defects, including inflexible intrusive …

AI-assisted framework for green-routing and load balancing in hybrid software-defined networking: Proposal, challenges and future perspective

R Etengu, SC Tan, LC Kwang, FM Abbou… - IEEE …, 2020 - ieeexplore.ieee.org
The explosive growth of IP networks, the advent of cloud computing, and the rapid progress
in wireless communications witnessed today reflect significant progress towards meeting the …

KubFBS: A fine‐grained and balance‐aware scheduling system for deep learning tasks based on kubernetes

Z Liu, C Chen, J Li, Y Cheng, Y Kou… - Concurrency and …, 2022 - Wiley Online Library
The past decade witnessed a remarkable increase in deep learning (DL) workloads which
require GPU resources to accelerate the training process. However, the existing coarse …

CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework

Y Zhao, Y Liu, B Jiang, T Guo - arxiv preprint arxiv:2406.01414, 2024 - arxiv.org
This work presents a novel approach to neural architecture search (NAS) that aims to
increase carbon efficiency for the model design process. The proposed framework CE-NAS …

Dynamic k-means clustering of workload and cloud resource configuration for cloud elastic model

T Daradkeh, A Agarwal, M Zaman, N Goel - IEEE Access, 2020 - ieeexplore.ieee.org
Cloud elasticity involves timely provisioning and de-provisioning of computing resources
and adjusting resources size to meet the dynamic workload demand. This requires fast, and …

Building efficient and practical machine learning systems

Q Hu - 2023 - dr.ntu.edu.sg
With the widespread adoption of deep learning (DL) applications in recent years, training DL
models has become increasingly prevalent. Nevertheless, training these models is typically …

Research on Edge-Computing for Independent task assignment based on deep reinforcement learning

J Kan, X Zhou - 2022 10th International Conference on …, 2022 - ieeexplore.ieee.org
Edge computing is an important group test part in the modern Internet of Things architecture.
Due to the uncertainty of the environment, the assignment algorithm of computing task …

[PDF][PDF] Deep Reinforcement Learning Based Weather Monitoring Systemusing Arduino for Smart Environment

RS Vignesh, A Sivakumar, M Shyam… - International Journal of …, 2019 - researchgate.net
Weather forecasting is an essential predictive challenge that has depended primarily on
model-based methods. Collection of data about the different weather parameters is needed …

[PDF][PDF] Dynamic K-Means Clustering of Workload and Cloud Resource Configuration for Cloud Elastic Model

M ZAMAN, N GOEL - academia.edu
Cloud elasticity involves timely provisioning and de-provisioning of computing resources
and adjusting resources size to meet the dynamic workload demand. This requires fast, and …

An Optimized Deep Machine Learning and Micro-Services Architecture based Proactive Elastic Cloud Framework

T Daradkeh - 2021 - spectrum.library.concordia.ca
To achieve elasticity in cloud environment a holistic solution must be considered that
measures all running applications and resources performance, including its cloud …