State-of-the-art of artificial intelligence and big data analytics reviews in five different domains: a bibliometric summary
Academicians and practitioners have recently begun to accord Artificial Intelligence (AI) and
Big Data Analytics (BDA) significant consideration when exploring emerging research trends …
Big Data Analytics (BDA) significant consideration when exploring emerging research trends …
Improving MapReduce performance using smart speculative execution strategy
MapReduce is a widely used parallel computing framework for large scale data processing.
The two major performance metrics in MapReduce are job execution time and cluster …
The two major performance metrics in MapReduce are job execution time and cluster …
BlobSeer: Next-generation data management for large scale infrastructures
As data volumes increase at a high speed in more and more application fields of science,
engineering, information services, etc., the challenges posed by data-intensive computing …
engineering, information services, etc., the challenges posed by data-intensive computing …
Predicting software anomalies using machine learning techniques
In this paper, we present a detailed evaluation of a set of well-known Machine Learning
classifiers in front of dynamic and non-deterministic software anomalies. The system state …
classifiers in front of dynamic and non-deterministic software anomalies. The system state …
High throughput data-compression for cloud storage
B Nicolae - International Conference on Data Management in Grid …, 2010 - Springer
As data volumes processed by large-scale distributed data-intensive applications grow at
high-speed, an increasing I/O pressure is put on the underlying storage service, which is …
high-speed, an increasing I/O pressure is put on the underlying storage service, which is …
Going back and forth: Efficient multideployment and multisnapshotting on clouds
Infrastructure as a Service (IaaS) cloud computing has revolutionized the way we think of
acquiring resources by introducing a simple change: allowing users to lease computational …
acquiring resources by introducing a simple change: allowing users to lease computational …
ScaDiPaSi: an effective scalable and distributable MapReduce-based method to find patient similarity on huge healthcare networks
M Barkhordari, M Niamanesh - Big Data Research, 2015 - Elsevier
Healthcare network information growth follows an exponential pattern, and current database
management systems cannot adequately manage this huge amount of data. It is necessary …
management systems cannot adequately manage this huge amount of data. It is necessary …
Skyline recomputation in big data
Retrieving relevant information in Big Data is a difficult task. Many approaches are used to
select highly relevant information. One of them is the Skyline operator, which is used to …
select highly relevant information. One of them is the Skyline operator, which is used to …
Optimizing a file system for different types of applications in a compute cluster using dynamic block size granularity
R Ananthanarayanan, K Gupta, P Pandey… - US Patent …, 2015 - Google Patents
Embodiments of the invention relate to optimizing a file system for different types of
applications in a compute cluster using dynamic block size granularity. An exemplary …
applications in a compute cluster using dynamic block size granularity. An exemplary …
Modeling of distributed file systems for practical performance analysis
Y Wu, F Ye, K Chen, W Zheng - IEEE Transactions on parallel …, 2013 - ieeexplore.ieee.org
Cloud computing has received significant attention recently. Delivering quality guaranteed
services in clouds is highly desired. Distributed file systems (DFSs) are the key component …
services in clouds is highly desired. Distributed file systems (DFSs) are the key component …