Performance and energy efficiency of big data applications in cloud environments: A Hadoop case study

E Feller, L Ramakrishnan, C Morin - Journal of Parallel and Distributed …, 2015 - Elsevier
The exponential growth of scientific and business data has resulted in the evolution of the
cloud computing environments and the MapReduce parallel programming model. The focus …

An evaluation of cassandra for hadoop

E Dede, B Sendir, P Kuzlu, J Hartog… - 2013 IEEE Sixth …, 2013 - ieeexplore.ieee.org
In the last decade, the increased use and growth of social media, unconventional web
technologies, and mobile applications, have all encouraged development of a new breed of …

BDEv 3.0: energy efficiency and microarchitectural characterization of Big Data processing frameworks

J Veiga, J Enes, RR Expósito, J Tourino - Future Generation Computer …, 2018 - Elsevier
As the size of Big Data workloads keeps increasing, the evaluation of distributed frameworks
becomes a crucial task in order to identify potential performance bottlenecks that may delay …

Processing cassandra datasets with hadoop-streaming based approaches

E Dede, B Sendir, P Kuzlu, J Weachock… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
The progressive transition in the nature of both scientific and industrial datasets has been
the driving force behind the development and research interests in the NoSQL model …

Towards a comprehensive set of big data benchmarks

GC Fox, S Jha, J Qiu, S Ekanayake… - Big Data and High …, 2015 - ebooks.iospress.nl
This paper reviews the Ogre classification of Big Data application with 50 facets divided into
four groups or views. These four correspond to Problem Architecture, Execution mode, Data …

Marissa: Mapreduce implementation for streaming science applications

E Dede, Z Fadika, J Hartog… - 2012 IEEE 8th …, 2012 - ieeexplore.ieee.org
MapReduce has since its inception been steadily gaining ground in various scientific
disciplines ranging from space exploration to protein folding. The model poses a challenge …

On the performance and energy efficiency of Hadoop deployment models

E Feller, L Ramakrishnan… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
The exponential growth of scientific and business data has resulted in the evolution of the
cloud computing and the MapReduce parallel programming model. Cloud computing …

Analysis and evaluation of MapReduce solutions on an HPC cluster

J Veiga, RR Expósito, GL Taboada, J Tourino - Computers & Electrical …, 2016 - Elsevier
The ever growing needs of Big Data applications are demanding challenging capabilities
which cannot be handled easily by traditional systems, and thus more and more …

MARIANE: Using MapReduce in HPC environments

Z Fadika, E Dede, M Govindaraju… - Future Generation …, 2014 - Elsevier
MapReduce is increasingly becoming a popular programming model. However, the widely
used implementation, Apache Hadoop, uses the Hadoop Distributed File System (HDFS) …

[PDF][PDF] Performance Evaluation of Big Data Analysis.

J Veiga, RR Expósito, J Touriño - 2019 - ghpc.udc.es
Evaluating the performance of Big Data systems is the usual way of getting information
about the expected execution time of analytics applications. These applications are …