Big data analytics on Apache Spark

S Salloum, R Dautov, X Chen, PX Peng… - International Journal of …, 2016 - Springer
Apache Spark has emerged as the de facto framework for big data analytics with its
advanced in-memory programming model and upper-level libraries for scalable machine …

The big data system, components, tools, and technologies: a survey

TR Rao, P Mitra, R Bhatt, A Goswami - Knowledge and Information …, 2019 - Springer
The traditional databases are not capable of handling unstructured data and high volumes
of real-time datasets. Diverse datasets are unstructured lead to big data, and it is laborious …

Fuzzy hypergraph network for recommending top-K profitable stocks

X Ma, T Zhao, Q Guo, X Li, C Zhang - Information Sciences, 2022 - Elsevier
Stock ranking prediction is an effective method for screening high investment value stocks in
the future and can strongly assist investors in making decisions. However, this task is also …

Memtune: Dynamic memory management for in-memory data analytic platforms

L Xu, M Li, L Zhang, AR Butt, Y Wang… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
Memory is a crucial resource for big data processing frameworks such as Spark and M3R,
where the memory is used both for computation and for caching intermediate storage data …

Management and analysis of big graph data: current systems and open challenges

M Junghanns, A Petermann, M Neumann… - Handbook of big data …, 2017 - Springer
Many big data applications in business and science require the management and analysis
of huge amounts of graph data. Suitable systems to manage and to analyze such graph data …

VENUS: Vertex-centric streamlined graph computation on a single PC

J Cheng, Q Liu, Z Li, W Fan, JCS Lui… - 2015 IEEE 31st …, 2015 - ieeexplore.ieee.org
Recent studies show that disk-based graph computation on just a single PC can be as
highly competitive as cluster-based computing systems on large-scale problems. Inspired by …

A comprehensive survey on cloud data mining (CDM) frameworks and algorithms

HB Barua, KC Mondal - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Data mining is used for finding meaningful information out of a vast expanse of data. With
the advent of Big Data concept, data mining has come to much more prominence …

Understanding the behavior of in-memory computing workloads

T Jiang, Q Zhang, R Hou, L Chai… - 2014 IEEE …, 2014 - ieeexplore.ieee.org
The increasing demands of big data applications have led researchers and practitioners to
turn to in-memory computing to speed processing. For instance, the Apache Spark …

Systems for big-graphs

A Khan, S Elnikety - Proceedings of the VLDB Endowment, 2014 - dl.acm.org
Graphs have become increasingly important to represent highly-interconnected structures
and schema-less data including the World Wide Web, social networks, knowledge graphs …

Interactive big data management in healthcare using spark

J Archenaa, EAM Anita - Proceedings of the 3rd International Symposium …, 2016 - Springer
This paper gives an insight on how to use apache spark for performing predictive analytics
using the healthcare data. Large amount of data such as Physician notes, medical history …