Task scheduling in big data platforms: a systematic literature review

M Soualhia, F Khomh, S Tahar - Journal of Systems and Software, 2017 - Elsevier
Abstract Context: Hadoop, Spark, Storm, and Mesos are very well known frameworks in both
research and industrial communities that allow expressing and processing distributed …

Reservation-based scheduling: If you're late don't blame us!

C Curino, DE Difallah, C Douglas, S Krishnan… - Proceedings of the …, 2014 - dl.acm.org
The continuous shift towards data-driven approaches to business, and a growing attention to
improving return on investments (ROI) for cluster infrastructures is generating new …

Improving performance of heterogeneous mapreduce clusters with adaptive task tuning

D Cheng, J Rao, Y Guo, C Jiang… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Datacenter-scale clusters are evolving toward heterogeneous hardware architectures due to
continuous server replacement. Meanwhile, datacenters are commonly shared by many …

Preemptive, low latency datacenter scheduling via lightweight virtualization

W Chen, J Rao, X Zhou - 2017 USENIX Annual Technical Conference …, 2017 - usenix.org
Data centers are evolving to host heterogeneous workloads on shared clusters to reduce the
operational cost and achieve higher resource utilization. However, it is challenging to …

Energy efficiency aware task assignment with dvfs in heterogeneous hadoop clusters

D Cheng, X Zhou, P Lama, M Ji… - Ieee transactions on …, 2017 - ieeexplore.ieee.org
While Hadoop ecosystems become increasingly important for practitioners of large-scale
data analysis, they also incur tremendous energy cost. This trend is driving up the need for …

Resource and deadline-aware job scheduling in dynamic hadoop clusters

D Cheng, J Rao, C Jiang, X Zhou - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
As Hadoop is becoming increasingly popular in large-scale data analysis, there is a growing
need for providing predictable services to users who have strict requirements on job …

Improving mapreduce performance in heterogeneous environments with adaptive task tuning

D Cheng, J Rao, Y Guo, X Zhou - Proceedings of the 15th International …, 2014 - dl.acm.org
The deployment of MapReduce in datacenters and clouds present several challenges in
achieving good job performance. Compared to in-house dedicated clusters, datacenters and …

MapReduce scheduling for deadline-constrained jobs in heterogeneous cloud computing systems

CH Chen, JW Lin, SY Kuo - IEEE Transactions on Cloud …, 2015 - ieeexplore.ieee.org
MapReduce is a software framework for processing data-intensive applications with a
parallel manner in cloud computing systems. Some MapReduce jobs have the deadline …

Predicting and mitigating jobs failures in big data clusters

A Rosa, LY Chen, W Binder - 2015 15th IEEE/ACM …, 2015 - ieeexplore.ieee.org
In large-scale data enters, software and hardware failures are frequent, resulting in failures
of job executions that may cause significant resource waste and performance deterioration …

Woha: Deadline-aware map-reduce workflow scheduling framework over hadoop clusters

S Li, S Hu, S Wang, L Su, T Abdelzaher… - 2014 IEEE 34th …, 2014 - ieeexplore.ieee.org
In this paper, we present WOHA, an efficient scheduling framework for deadline-aware Map-
Reduce workflows. In data centers, complex backend data analysis often utilizes a workflow …