A taxonomy and survey on green data center networks

K Bilal, SUR Malik, O Khalid, A Hameed… - Future Generation …, 2014 - Elsevier
Data centers are growing exponentially (in number and size) to accommodate the escalating
user and application demands. Likewise, the concerns about the environmental impacts …

Big data resource management & networks: Taxonomy, survey, and future directions

FM Awaysheh, M Alazab, S Garg… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
Big Data (BD) platforms have a long tradition of leveraging trends and technologies from the
broader computer network and communication community. For several years, dedicated …

T-storm: Traffic-aware online scheduling in storm

J Xu, Z Chen, J Tang, S Su - 2014 IEEE 34th International …, 2014 - ieeexplore.ieee.org
Storm has emerged as a promising computation platform for stream data processing. In this
paper, we first show inefficiencies of the current practice of Storm scheduling and challenges …

Network-aware scheduling for data-parallel jobs: Plan when you can

V Jalaparti, P Bodik, I Menache, S Rao… - ACM SIGCOMM …, 2015 - dl.acm.org
To reduce the impact of network congestion on big data jobs, cluster management
frameworks use various heuristics to schedule compute tasks and/or network flows. Most of …

Investigation of data locality in mapreduce

Z Guo, G Fox, M Zhou - … Symposium on Cluster, Cloud and Grid …, 2012 - ieeexplore.ieee.org
Traditional HPC architectures separate compute nodes and storage nodes, which are
interconnected with high speed links to satisfy data access requirements in multi-user …

{ShuffleWatcher}: Shuffle-aware scheduling in multi-tenant {MapReduce} clusters

F Ahmad, ST Chakradhar, A Raghunathan… - 2014 USENIX Annual …, 2014 - usenix.org
MapReduce clusters are usually multi-tenant (ie, shared among multiple users and jobs) for
improving cost and utilization. The performance of jobs in a multi-tenant MapReduce cluster …

[HTML][HTML] Optimizing VM allocation and data placement for data-intensive applications in cloud using ACO metaheuristic algorithm

TP Shabeera, SDM Kumar, SM Salam… - Engineering Science and …, 2017 - Elsevier
Nowadays data-intensive applications for processing big data are being hosted in the cloud.
Since the cloud environment provides virtualized resources for computation, and data …

Energy-aware scheduling of mapreduce jobs for big data applications

L Mashayekhy, MM Nejad, D Grosu… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
The majority of large-scale data intensive applications executed by data centers are based
on MapReduce or its open-source implementation, Hadoop. Such applications are executed …

Interference and locality-aware task scheduling for MapReduce applications in virtual clusters

X Bu, J Rao, C Xu - Proceedings of the 22nd international symposium …, 2013 - dl.acm.org
MapReduce emerges as an important distributed programming paradigm for large-scale
applications. Running MapReduce applications in clouds presents an attractive usage …

Encoded bitmap indexing for data warehouses

MC Wu, AP Buchmann - Proceedings 14th International …, 1998 - ieeexplore.ieee.org
Complex query types, huge data volumes, and very high read/update ratios make the
indexing techniques designed and tuned for traditional database systems unsuitable for …