[PDF][PDF] Predicting potential banking customer churn using apache spark ML and MLlib packages: a comparative study

H Sayed, MA Abdel-Fattah… - International Journal of …, 2018 - researchgate.net
This study was conducted based on an assumption that Spark ML package has much better
performance and accuracy than Spark MLlib package in dealing with big data. The used …

A study of big data analytics using apache spark with Python and Scala

YK Gupta, S Kumari - 2020 3rd International Conference on …, 2020 - ieeexplore.ieee.org
Data is generated by humans every day via various sources such as Instagram, Facebook,
Twitter, Google, etc at a rate of 2.5 quintillion bytes with high volume, high speed and high …

SMBSP: a self-tuning approach using machine learning to improve performance of spark in big data processing

MA Rahman, J Hossen… - 2018 7th International …, 2018 - ieeexplore.ieee.org
Apache Spark, popularly known for big data processing capability, is a distributed open-
source platform that uses the concept of distributed memory to facilitate big data processing …

Dynamic Distributed and Parallel Machine Learning algorithms for big data mining processing

L Djafri - Data Technologies and Applications, 2022 - emerald.com
Purpose This work can be used as a building block in other settings such as GPU, Map-
Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems …

Real-time web-based International Flight Tickets Recommendation System via Apache Spark

M Malkawi, R Alhajj - … on Information Reuse and Integration for …, 2023 - ieeexplore.ieee.org
Traveling by airplane has become more popular with advanced technology. The tickets can
be booked effortlessly via airlines corporation's online platforms. However, recommending …

Performance evaluation of Spark SQL for batch processing

K Anusha, K Usha Rani - … Research in Data Engineering Systems and …, 2020 - Springer
Now-a-days, large amount of data is being generated at various organizations. In many
organizations, there is an inefficiency of handling Big Data with higher volumes, velocity …

[PDF][PDF] A smart method for spark using neural network for big data

MA Rahman, J Hossen, A Sultana… - International Journal of …, 2021 - academia.edu
Apache spark, famously known for big data handling ability, is a distributed open-source
framework that utilizes the idea of distributed memory to process big data. As the …

A study of chronic diseases by using apache pig framework

S Choudhary, YK Gupta - AIP Conference Proceedings, 2023 - pubs.aip.org
Data is growing at a breakneck speed these days. Every day, new data is generated from a
variety of sources, including Facebook, Instagram, Twitter, e-commerce sites, and many …

Enhancement of video streaming analysis using cluster-computing framework

J Arthanari, R Baskaran - Cluster Computing, 2019 - Springer
Video content analysis is an emerging technique to easily redact video footage for public
disclosure and to identify events and objects in surveillance cameras. The proficiency of this …

DNA barcoding using particle swarm optimization on apache spark SQL case study: DNA of covid-19

LS Riza, MI Nurfathiya, J Kusnendar… - … Journal of Nonlinear …, 2021 - ijnaa.semnan.ac.ir
The objective of this research is to design and implement a computational model to
determine DNA barcodes by utilizing the Particle Swarm Optimization (PSO) algorithms …