A survey on automatic parameter tuning for big data processing systems

H Herodotou, Y Chen, J Lu - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Big data processing systems (eg, Hadoop, Spark, Storm) contain a vast number of
configuration parameters controlling parallelism, I/O behavior, memory settings, and …

MapReduce scheduling algorithms in Hadoop: a systematic study

S Hedayati, N Maleki, T Olsson, F Ahlgren… - Journal of Cloud …, 2023 - Springer
Hadoop is a framework for storing and processing huge volumes of data on clusters. It uses
Hadoop Distributed File System (HDFS) for storing data and uses MapReduce to process …

IoTDeM: An IoT Big Data-oriented MapReduce performance prediction extended model in multiple edge clouds

Z Lu, N Wang, J Wu, M Qiu - Journal of Parallel and Distributed Computing, 2018 - Elsevier
Abstract Uploading all IoT Big Data to a centralized cloud for data analytics is infeasible
because of the excessive latency and bandwidth limitation of the Internet. A promising …

Hadoop performance modeling for job estimation and resource provisioning

M Khan, Y **, M Li, Y **ang… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
MapReduce has become a major computing model for data intensive applications. Hadoop,
an open source implementation of MapReduce, has been adopted by an increasingly …

Memory-aware resource management algorithm for low-energy cloud data centers

B Liang, X Dong, Y Wang, X Zhang - Future Generation Computer Systems, 2020 - Elsevier
The continuous advancement of cloud computing technology has driven the vigorous
development of cloud data centers. This manifests itself not only in increasing numbers, but …

Performance prediction of cloud-based big data applications

D Ardagna, E Barbierato, A Evangelinou… - Proceedings of the …, 2018 - dl.acm.org
Data heterogeneity and irregularity are key characteristics of big data applications that often
overwhelm the existing software and hardware infrastructures. In such context, the exibility …

-Simplexed: Adaptive Delaunay Triangulation for Performance Modeling and Prediction on Big Data Analytics

Y Chen, P Goetsch, MA Hoque, J Lu… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Big Data processing systems (eg, Spark) have a number of resource configuration
parameters, such as memory size, CPU allocation, and the number of running nodes …

A swarm intelligent method for traffic light scheduling: application to real urban traffic networks

W Hu, H Wang, L Yan, B Du - Applied Intelligence, 2016 - Springer
Traffic lights play an important role nowadays for solving complex and serious urban traffic
problems. How to optimize the schedule of hundreds of traffic lights has become a …

A novel configuration tuning method based on feature selection for Hadoop MapReduce

J Liu, S Tang, G Xu, C Ma, M Lin - IEEE Access, 2020 - ieeexplore.ieee.org
Configuration parameter optimization is an important means of improving the performance of
the MapReduce model. The existing parameter tuning methods usually optimize all …

Ad-hoc cloudlet based cooperative cloud gaming

F Chi, X Wang, W Cai… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
As the game industry matures, processing complex game logics in a timely manner is no
longer an insurmountable problem. However, current cloud-based mobile gaming solutions …