SQL and NoSQL database software architecture performance analysis and assessments—a systematic literature review
The competent software architecture plays a crucial role in the difficult task of big data
processing for SQL and NoSQL databases. SQL databases were created to organize data …
processing for SQL and NoSQL databases. SQL databases were created to organize data …
The family of mapreduce and large-scale data processing systems
In the last two decades, the continuous increase of computational power has produced an
overwhelming flow of data which has called for a paradigm shift in the computing …
overwhelming flow of data which has called for a paradigm shift in the computing …
Parallel data processing with MapReduce: a survey
A prominent parallel data processing tool MapReduce is gaining significant momentum from
both industry and academia as the volume of data to analyze grows rapidly. While …
both industry and academia as the volume of data to analyze grows rapidly. While …
[PDF][PDF] Starfish: A self-tuning system for big data analytics.
Timely and cost-effective analytics over “Big Data” is now a key ingredient for success in
many businesses, scientific and engineering disciplines, and government endeavors. The …
many businesses, scientific and engineering disciplines, and government endeavors. The …
Big data processing in cloud computing environments
With the rapid growth of emerging applications like social network analysis, semantic Web
analysis and bioinformatics network analysis, a variety of data to be processed continues to …
analysis and bioinformatics network analysis, a variety of data to be processed continues to …
Profiling, what-if analysis, and cost-based optimization of mapreduce programs
MapReduce has emerged as a viable competitor to database systems in big data analytics.
MapReduce programs are being written for a wide variety of application domains including …
MapReduce programs are being written for a wide variety of application domains including …
A survey of large-scale analytical query processing in MapReduce
Enterprises today acquire vast volumes of data from different sources and leverage this
information by means of data analysis to support effective decision-making and provide new …
information by means of data analysis to support effective decision-making and provide new …
Hyracks: A flexible and extensible foundation for data-intensive computing
V Borkar, M Carey, R Grover, N Onose… - 2011 IEEE 27th …, 2011 - ieeexplore.ieee.org
Hyracks is a new partitioned-parallel software platform designed to run data-intensive
computations on large shared-nothing clusters of computers. Hyracks allows users to …
computations on large shared-nothing clusters of computers. Hyracks allows users to …
Distributed data management using MapReduce
MapReduce is a framework for processing and managing large-scale datasets in a
distributed cluster, which has been used for applications such as generating search indexes …
distributed cluster, which has been used for applications such as generating search indexes …
A comprehensive view of Hadoop research—A systematic literature review
Context: In recent years, the valuable knowledge that can be retrieved from petabyte scale
datasets–known as Big Data–led to the development of solutions to process information …
datasets–known as Big Data–led to the development of solutions to process information …