Morpheus: Towards automated {SLOs} for enterprise clusters

SA Jyothi, C Curino, I Menache… - … USENIX symposium on …, 2016 - usenix.org
Modern resource management frameworks for largescale analytics leave unresolved the
problematic tension between high cluster utilization and job's performance predictability …

Online job scheduling in distributed machine learning clusters

Y Bao, Y Peng, C Wu, Z Li - IEEE INFOCOM 2018-IEEE …, 2018 - ieeexplore.ieee.org
Nowadays large-scale distributed machine learning systems have been deployed to support
various analytics and intelligence services in IT firms. To train a large dataset and derive the …

Reservation-based scheduling: If you're late don't blame us!

C Curino, DE Difallah, C Douglas, S Krishnan… - Proceedings of the …, 2014 - dl.acm.org
The continuous shift towards data-driven approaches to business, and a growing attention to
improving return on investments (ROI) for cluster infrastructures is generating new …

Competitive algorithms for the online multiple knapsack problem with application to electric vehicle charging

B Sun, A Zeynali, T Li, M Hajiesmaili… - Proceedings of the …, 2020 - dl.acm.org
We introduce and study a general version of the fractional online knapsack problem with
multiple knapsacks, heterogeneous constraints on which items can be assigned to which …

An efficient cloud market mechanism for computing jobs with soft deadlines

R Zhou, Z Li, C Wu, Z Huang - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
This paper studies the cloud market for computing jobs with completion deadlines, and
designs efficient online auctions for cloud resource provisioning. A cloud user bids for future …

Optimal online data partitioning for geo-distributed machine learning in edge of wireless networks

X Lyu, C Ren, W Ni, H Tian, RP Liu… - IEEE Journal on …, 2019 - ieeexplore.ieee.org
To enable machine learning at the edge of wireless networks (such as edge cloud), close to
mobile users, is critical for future wireless networks, but challenging since the lower layers in …

Online EV charging scheduling with on-arrival commitment

B Alinia, MH Hajiesmaili… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
The rapid proliferation of electric vehicles has resulted in a drastic increase in the total
energy demand of EVs. Given the limited charging rate capacity of charging stations and …

Edge-llm: A collaborative framework for large language model serving in edge computing

F Cai, D Yuan, Z Yang, L Cui - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
The rapid advancement and extensive implementation of Large Language Models (LLMs)
are milestones in the realm of artificial intelligence. Although Parameter-Efficient Transfer …

Near-optimal scheduling mechanisms for deadline-sensitive jobs in large computing clusters

N Jain, I Menache, J Naor, J Yaniv - ACM Transactions on Parallel …, 2015 - dl.acm.org
We consider a market-based resource allocation model for batch jobs in cloud computing
clusters. In our model, we incorporate the importance of the due date of a job rather than the …

Online virtual machine allocation with lifetime and load predictions

N Buchbinder, Y Fairstein, K Mellou… - ACM SIGMETRICS …, 2021 - dl.acm.org
The cloud computing industry has grown rapidly over the last decade, and with this growth
there is a significant increase in demand for compute resources. Demand is manifested in …