State-of-the-art of artificial intelligence and big data analytics reviews in five different domains: a bibliometric summary

PV Thayyib, R Mamilla, M Khan, H Fatima, M Asim… - Sustainability, 2023 - mdpi.com
Academicians and practitioners have recently begun to accord Artificial Intelligence (AI) and
Big Data Analytics (BDA) significant consideration when exploring emerging research trends …

Improving MapReduce performance using smart speculative execution strategy

Q Chen, C Liu, Z **ao - IEEE Transactions on Computers, 2013 - ieeexplore.ieee.org
MapReduce is a widely used parallel computing framework for large scale data processing.
The two major performance metrics in MapReduce are job execution time and cluster …

BlobSeer: Next-generation data management for large scale infrastructures

B Nicolae, G Antoniu, L Bougé, D Moise… - Journal of Parallel and …, 2011 - Elsevier
As data volumes increase at a high speed in more and more application fields of science,
engineering, information services, etc., the challenges posed by data-intensive computing …

Predicting software anomalies using machine learning techniques

J Alonso, L Belanche… - 2011 IEEE 10th …, 2011 - ieeexplore.ieee.org
In this paper, we present a detailed evaluation of a set of well-known Machine Learning
classifiers in front of dynamic and non-deterministic software anomalies. The system state …

High throughput data-compression for cloud storage

B Nicolae - International Conference on Data Management in Grid …, 2010 - Springer
As data volumes processed by large-scale distributed data-intensive applications grow at
high-speed, an increasing I/O pressure is put on the underlying storage service, which is …

Going back and forth: Efficient multideployment and multisnapshotting on clouds

B Nicolae, J Bresnahan, K Keahey… - Proceedings of the 20th …, 2011 - dl.acm.org
Infrastructure as a Service (IaaS) cloud computing has revolutionized the way we think of
acquiring resources by introducing a simple change: allowing users to lease computational …

ScaDiPaSi: an effective scalable and distributable MapReduce-based method to find patient similarity on huge healthcare networks

M Barkhordari, M Niamanesh - Big Data Research, 2015 - Elsevier
Healthcare network information growth follows an exponential pattern, and current database
management systems cannot adequately manage this huge amount of data. It is necessary …

Skyline recomputation in big data

C Bourahla, R Maamri, S Brahimi - Information Systems, 2023 - Elsevier
Retrieving relevant information in Big Data is a difficult task. Many approaches are used to
select highly relevant information. One of them is the Skyline operator, which is used to …

Optimizing a file system for different types of applications in a compute cluster using dynamic block size granularity

R Ananthanarayanan, K Gupta, P Pandey… - US Patent …, 2015 - Google Patents
Embodiments of the invention relate to optimizing a file system for different types of
applications in a compute cluster using dynamic block size granularity. An exemplary …

Modeling of distributed file systems for practical performance analysis

Y Wu, F Ye, K Chen, W Zheng - IEEE Transactions on parallel …, 2013 - ieeexplore.ieee.org
Cloud computing has received significant attention recently. Delivering quality guaranteed
services in clouds is highly desired. Distributed file systems (DFSs) are the key component …