High performance hadoop distributed file system

M Elkawkagy, H Elbeh - … Journal of Networked and Distributed Computing, 2020‏ - Springer
Although by the end of 2020, most of companies will be running 1000 node Hadoop in the
system, the Hadoop implementation is still accompanied by many challenges like security …

An improved technique for increasing availability in Big Data replication

MR Kaseb, MH Khafagy, IA Ali, ESM Saad - Future generation computer …, 2019‏ - Elsevier
Big Data represents a major challenge for the performance of the cloud computing storage
systems. Some distributed file systems (DFS) are widely used to store big data, such as …

Density estimation-based method to determine sample size for random sample partition of big data

Y He, J Chen, J Shen, P Fournier-Viger… - Frontiers of Computer …, 2024‏ - Springer
Random sample partition (RSP) is a newly developed big data representation and
management model to deal with big data approximate computation problems. Academic …

Analysis of blockchain and interplanetary file system (IPFS) utilization for big data architecture optimization

AP Ahmad, AA Ilham… - 2023 IEEE International …, 2023‏ - ieeexplore.ieee.org
Big data is the collection of very complex data sets that are very difficult to process by
traditional data processing applications. This data comes from various devices or media that …

The Design and Implementation of Campus Network Streaming Media Live Video On-Demand System Based on Nginx and FFmpeg

B She, Q Wang, X Zhong, Z Zhang… - Journal of Physics …, 2020‏ - iopscience.iop.org
For the current demand for live broadcast and live video in colleges and universities, relying
on the advantages of campus network, this paper designed and implemented a video live …

[PDF][PDF] Enhancements and Intelligent Approach to Optimize Big data Storage and Management: Random Enhanced HDFS (REHDFS) and DNA Storage

MSNRJ Abouchabaka - … on Technical and Physical Problems of …, 2022‏ - iotpe.tabaelm.com
The evolution of mobile technology, the popularization of tablets and smartphones, the daily
data generated by industries, large organizations, and research institutes, and the rapid …

An improved data placement strategy in a heterogeneous hadoop cluster

W Zhao, L Meng, J Sun, Y Ding… - Open Cybernetics and …, 2015‏ - benthamopen.com
ABSTRACT Hadoop Distributed File System (HDFS) is designed to store big data reliably,
and to stream these data at high bandwidth to user applications. However, the default HDFS …

A network load sensitive block placement strategy of HDFS

L Meng, W Zhao, H Zhao, Y Ding - KSII Transactions on Internet …, 2015‏ - koreascience.kr
This paper investigates and analyzes the default block placement strategy of HDFS. HDFS is
a typical representative distributed file system to stream vast amount of data effectively at …

Redundant independent files (RIF): a technique for reducing storage and resources in big data replication

MR Kaseb, MH Khafagy, IA Ali, ESM Saad - Trends and Advances in …, 2018‏ - Springer
Most of cloud computing storage systems widely use a distributed file system (DFS) to store
big data, such as Hadoop Distributed File System (HDFS) and Google File System (GFS) …

A comparative study of HDFS replication approaches

ES Abead, MH Khafagy… - International Journal in IT & …, 2015‏ - indianjournals.com
The Hadoop Distributed File System (HDFS) is designed to store, analysis, transfers large
scale of data sets, and stream it at high bandwidth to the user applications. It handles fault …