SQL and NoSQL database software architecture performance analysis and assessments—a systematic literature review

W Khan, T Kumar, C Zhang, K Raj, AM Roy… - Big Data and Cognitive …, 2023 - mdpi.com
The competent software architecture plays a crucial role in the difficult task of big data
processing for SQL and NoSQL databases. SQL databases were created to organize data …

The family of mapreduce and large-scale data processing systems

S Sakr, A Liu, AG Fayoumi - ACM Computing Surveys (CSUR), 2013 - dl.acm.org
In the last two decades, the continuous increase of computational power has produced an
overwhelming flow of data which has called for a paradigm shift in the computing …

Parallel data processing with MapReduce: a survey

KH Lee, YJ Lee, H Choi, YD Chung, B Moon - AcM sIGMoD record, 2012 - dl.acm.org
A prominent parallel data processing tool MapReduce is gaining significant momentum from
both industry and academia as the volume of data to analyze grows rapidly. While …

[PDF][PDF] Starfish: A self-tuning system for big data analytics.

H Herodotou, H Lim, G Luo, N Borisov, L Dong… - Cidr, 2011 - cse.fau.edu
Timely and cost-effective analytics over “Big Data” is now a key ingredient for success in
many businesses, scientific and engineering disciplines, and government endeavors. The …

Big data processing in cloud computing environments

C Ji, Y Li, W Qiu, U Awada, K Li - 2012 12th international …, 2012 - ieeexplore.ieee.org
With the rapid growth of emerging applications like social network analysis, semantic Web
analysis and bioinformatics network analysis, a variety of data to be processed continues to …

Profiling, what-if analysis, and cost-based optimization of mapreduce programs

H Herodotou, S Babu - Proceedings of the VLDB Endowment, 2011 - dl.acm.org
MapReduce has emerged as a viable competitor to database systems in big data analytics.
MapReduce programs are being written for a wide variety of application domains including …

A survey of large-scale analytical query processing in MapReduce

C Doulkeridis, K Nørvåg - The VLDB journal, 2014 - Springer
Enterprises today acquire vast volumes of data from different sources and leverage this
information by means of data analysis to support effective decision-making and provide new …

Hyracks: A flexible and extensible foundation for data-intensive computing

V Borkar, M Carey, R Grover, N Onose… - 2011 IEEE 27th …, 2011 - ieeexplore.ieee.org
Hyracks is a new partitioned-parallel software platform designed to run data-intensive
computations on large shared-nothing clusters of computers. Hyracks allows users to …

Distributed data management using MapReduce

F Li, BC Ooi, MT Özsu, S Wu - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
MapReduce is a framework for processing and managing large-scale datasets in a
distributed cluster, which has been used for applications such as generating search indexes …

A comprehensive view of Hadoop research—A systematic literature review

I Polato, R Ré, A Goldman, F Kon - Journal of Network and Computer …, 2014 - Elsevier
Context: In recent years, the valuable knowledge that can be retrieved from petabyte scale
datasets–known as Big Data–led to the development of solutions to process information …