G-Hadoop: MapReduce across distributed data centers for data-intensive computing
Recently, the computational requirements for large-scale data-intensive analysis of scientific
data have grown significantly. In High Energy Physics (HEP) for example, the Large Hadron …
data have grown significantly. In High Energy Physics (HEP) for example, the Large Hadron …
[BOOK][B] Essentials of cloud computing
K Chandrasekaran - 2014 - books.google.com
Cloud computing—accessing computing resources over the Internet—is rapidly changing
the landscape of information technology. Its primary benefits compared to on-premise …
the landscape of information technology. Its primary benefits compared to on-premise …
An overview of the open science data cloud
RL Grossman, Y Gu, J Mambretti, M Sabala… - Proceedings of the 19th …, 2010 - dl.acm.org
The Open Science Data Cloud is a distributed cloud based infrastructure for managing,
analyzing, archiving and sharing scientific datasets. We introduce the Open Science Data …
analyzing, archiving and sharing scientific datasets. We introduce the Open Science Data …
Data-intensive cloud computing: requirements, expectations, challenges, and solutions
Data-intensive systems encompass terabytes to petabytes of data. Such systems require
massive storage and intensive computational power in order to execute complex queries …
massive storage and intensive computational power in order to execute complex queries …
An improved partitioning mechanism for optimizing massive data analysis using MapReduce
In the era of Big Data, huge amounts of structured and unstructured data are being produced
daily by a myriad of ubiquitous sources. Big Data is difficult to work with and requires …
daily by a myriad of ubiquitous sources. Big Data is difficult to work with and requires …
Data and task parallelism in ILP using MapReduce
A Srinivasan, TA Faruquie, S Joshi - Machine learning, 2012 - Springer
Nearly two decades of research in the area of Inductive Logic Programming (ILP) have seen
steady progress in clarifying its theoretical foundations and regular demonstrations of its …
steady progress in clarifying its theoretical foundations and regular demonstrations of its …
[PDF][PDF] Data-Intensive Computing on Grid Computing Environment
P Raina, H Shah - International Journal of Open Publication and … - researchgate.net
Grid computing raises challenging issues in many areas of computer science,
bioinformatics, high energy physics and especially in the area of distributed computing, as …
bioinformatics, high energy physics and especially in the area of distributed computing, as …
An adaptive and memory efficient sampling mechanism for partitioning in MapReduce
Big Data refers to the massive amounts of structured and unstructured data being produced
every day from a wide range of sources. Big Data is difficult to work with and needs a large …
every day from a wide range of sources. Big Data is difficult to work with and needs a large …
Security in data intensive computing systems
EB Fernandez - Handbook of Data Intensive Computing, 2011 - Springer
Many applications, eg, scientific computing, weather prediction, medical image processing,
require the manipulation of large amounts of data. Analysis of web traffic, sales, travel, and …
require the manipulation of large amounts of data. Analysis of web traffic, sales, travel, and …
Understanding scientific applications for cloud environments
Distributed systems and their specific incarnations have evolved significantly over the years.
Most often, these evolutionary steps have been a consequence of external technology …
Most often, these evolutionary steps have been a consequence of external technology …