Learning scheduling algorithms for data processing clusters
Efficiently scheduling data processing jobs on distributed compute clusters requires complex
algorithms. Current systems use simple, generalized heuristics and ignore workload …
algorithms. Current systems use simple, generalized heuristics and ignore workload …
Open problems in queueing theory inspired by datacenter computing
M Harchol-Balter - Queueing Systems, 2021 - Springer
Datacenter operations today provide a plethora of new queueing and scheduling problems.
The notion of a “job” has become more general and multi-dimensional. The ways in which …
The notion of a “job” has become more general and multi-dimensional. The ways in which …
Online Scheduling via Gradient Descent for Weighted Flow Time Minimization
In this paper, we explore how a natural generalization of Shortest Remaining Processing
Time (SRPT) can be a powerful meta-algorithm for online scheduling. The meta-algorithm …
Time (SRPT) can be a powerful meta-algorithm for online scheduling. The meta-algorithm …
Optimizing job offloading schedule for collaborative DNN inference
Deep Neural Networks (DNNs) have been widely deployed in mobile applications. DNN
inference latency is a critical metric to measure the service quality of those applications …
inference latency is a critical metric to measure the service quality of those applications …
Adaptive scheduling of multiprogrammed dynamic-multithreading applications
Modern parallel platforms, such as clouds or servers, are often shared among many different
jobs. However, existing parallel programming runtime systems are designed and optimized …
jobs. However, existing parallel programming runtime systems are designed and optimized …
Energy-efficient scheduling and routing via randomized rounding
We propose a unifying framework based on configuration linear programs and randomized
rounding, for different energy optimization problems in the dynamic speed-scaling setting …
rounding, for different energy optimization problems in the dynamic speed-scaling setting …
Optimal resource allocation for elastic and inelastic jobs
Modern data centers are tasked with processing heterogeneous workloads consisting of
various classes of jobs. These classes differ in their arrival rates, size distributions, and job …
various classes of jobs. These classes differ in their arrival rates, size distributions, and job …
Cloud computing value chains: Research from the operations management perspective
Problem definition: Cloud computing is recognized as a critical driver of information
technology–enabled innovations. The operations management (OM) community, however …
technology–enabled innovations. The operations management (OM) community, however …
DAG-aware harmonizing job scheduling and data caching for disaggregated analytics frameworks
Y Tong, J Liu, H Wang, M He, K Zhou, R He… - Future Generation …, 2024 - Elsevier
Modern data analytics frameworks often integrate with external storage services, which can
lead to storage bottlenecks. Existing caching and prefetching solutions utilize high-level …
lead to storage bottlenecks. Existing caching and prefetching solutions utilize high-level …
Hierarchy-based algorithms for minimizing makespan under precedence and communication constraints
We consider the classic problem of scheduling jobs with precedence constraints on a set of
identical machines to minimize the makespan objective function. Understanding the exact …
identical machines to minimize the makespan objective function. Understanding the exact …