A survey on data storage and placement methodologies for cloud-big data ecosystem

S Mazumdar, D Seybold, K Kritikos, Y Verginadis - Journal of Big Data, 2019 - Springer
Currently, the data to be explored and exploited by computing systems increases at an
exponential rate. The massive amount of data or so-called “Big Data” put pressure on …

Big data resource management & networks: Taxonomy, survey, and future directions

FM Awaysheh, M Alazab, S Garg… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
Big Data (BD) platforms have a long tradition of leveraging trends and technologies from the
broader computer network and communication community. For several years, dedicated …

A data placement strategy in scientific cloud workflows

D Yuan, Y Yang, X Liu, J Chen - Future Generation Computer Systems, 2010 - Elsevier
In scientific cloud workflows, large amounts of application data need to be stored in
distributed data centres. To effectively store these data, a data manager must intelligently …

Moon: Mapreduce on opportunistic environments

H Lin, X Ma, J Archuleta, W Feng, M Gardner… - Proceedings of the 19th …, 2010 - dl.acm.org
MapReduce offers an ease-of-use programming paradigm for processing large data sets,
making it an attractive model for distributed volunteer computing systems. However, unlike …

A genetic algorithm based data replica placement strategy for scientific applications in clouds

L Cui, J Zhang, L Yue, Y Shi, H Li… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Cloud computing is a promising distributed computing platform for big data applications, eg,
scientific applications, since excessive resources can be obtained from cloud services for …

Data placement in era of cloud computing: a survey, taxonomy and open research issues

A Kaur, P Gupta, M Singh, A Nayyar - Scalable Computing: Practice and …, 2019 - scpe.org
In cloud computing, data placement is a critical operation performed as part of workflow
management and aims to find the best physical machine to place the data. It has direct …

BitDew: A data management and distribution service with multi-protocol file transfer and metadata abstraction

G Fedak, H He, F Cappello - Journal of network and computer applications, 2009 - Elsevier
Desktop Grids use the computing, network and storage resources from idle desktop PCs
distributed over multiple-LANs or the Internet to compute a large variety of resource …

[PDF][PDF] 云计算环境下面向数据密集型应用的数据布局策略与方法

郑湃, 崔立真, 王海洋, 徐猛 - 计算机学报, 2010 - cjc.ict.ac.cn
摘要云计算环境下面向流程的数据密集型应用已被广泛应用于多个领域. 面对多数据中心的云
计算环境, 这类应用在数据布局方面遇到了新的挑战, 主要表现在如何减少跨数据中心的数据 …

A data and task co-scheduling algorithm for scientific cloud workflows

K Deng, K Ren, M Zhu, J Song - IEEE Transactions on Cloud …, 2015 - ieeexplore.ieee.org
Cloud computing has emerged as a promising computational infrastructure for cost-efficient
workflow execution by provisioning on-demand resources in a pay-as-you-go manner. While …

DCCP: an effective data placement strategy for data-intensive computations in distributed cloud computing systems

T Wang, S Yao, Z Xu, S Jia - The Journal of Supercomputing, 2016 - Springer
Cloud computing systems provide high-performance computing resources and distributed
storage space to deal with data-intensive computations. Data scheduling between data …