A taxonomy and survey on green data center networks
Data centers are growing exponentially (in number and size) to accommodate the escalating
user and application demands. Likewise, the concerns about the environmental impacts …
user and application demands. Likewise, the concerns about the environmental impacts …
Big data resource management & networks: Taxonomy, survey, and future directions
Big Data (BD) platforms have a long tradition of leveraging trends and technologies from the
broader computer network and communication community. For several years, dedicated …
broader computer network and communication community. For several years, dedicated …
T-storm: Traffic-aware online scheduling in storm
Storm has emerged as a promising computation platform for stream data processing. In this
paper, we first show inefficiencies of the current practice of Storm scheduling and challenges …
paper, we first show inefficiencies of the current practice of Storm scheduling and challenges …
Network-aware scheduling for data-parallel jobs: Plan when you can
To reduce the impact of network congestion on big data jobs, cluster management
frameworks use various heuristics to schedule compute tasks and/or network flows. Most of …
frameworks use various heuristics to schedule compute tasks and/or network flows. Most of …
Investigation of data locality in mapreduce
Traditional HPC architectures separate compute nodes and storage nodes, which are
interconnected with high speed links to satisfy data access requirements in multi-user …
interconnected with high speed links to satisfy data access requirements in multi-user …
{ShuffleWatcher}: Shuffle-aware scheduling in multi-tenant {MapReduce} clusters
MapReduce clusters are usually multi-tenant (ie, shared among multiple users and jobs) for
improving cost and utilization. The performance of jobs in a multi-tenant MapReduce cluster …
improving cost and utilization. The performance of jobs in a multi-tenant MapReduce cluster …
[HTML][HTML] Optimizing VM allocation and data placement for data-intensive applications in cloud using ACO metaheuristic algorithm
Nowadays data-intensive applications for processing big data are being hosted in the cloud.
Since the cloud environment provides virtualized resources for computation, and data …
Since the cloud environment provides virtualized resources for computation, and data …
Energy-aware scheduling of mapreduce jobs for big data applications
The majority of large-scale data intensive applications executed by data centers are based
on MapReduce or its open-source implementation, Hadoop. Such applications are executed …
on MapReduce or its open-source implementation, Hadoop. Such applications are executed …
Interference and locality-aware task scheduling for MapReduce applications in virtual clusters
MapReduce emerges as an important distributed programming paradigm for large-scale
applications. Running MapReduce applications in clouds presents an attractive usage …
applications. Running MapReduce applications in clouds presents an attractive usage …
Encoded bitmap indexing for data warehouses
Complex query types, huge data volumes, and very high read/update ratios make the
indexing techniques designed and tuned for traditional database systems unsuitable for …
indexing techniques designed and tuned for traditional database systems unsuitable for …