[HTML][HTML] Partition based clustering of large datasets using MapReduce framework: An analysis of recent themes and directions

TH Sardar, Z Ansari - Future Computing and Informatics Journal, 2018 - Elsevier
Data clustering is one of the fundamental techniques in scientific analysis and data mining,
which describes a dataset according to similarities among its objects. Partition based …

ARIS: a noise insensitive data pre-processing scheme for data reduction using influence space

J Cai, Y Yang, H Yang, X Zhao, J Hao - ACM Transactions on …, 2022 - dl.acm.org
The extensive growth of data quantity has posed many challenges to data analysis and
retrieval. Noise and redundancy are typical representatives of the above-mentioned …

Soft and declarative fishing of information in big data lake

B Małysiak-Mrozek, M Stabla… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
In recent years, many fields that experience a sudden proliferation of data, which increases
the volume of data that must be processed and the variety of formats the data is stored in …

High-efficient fuzzy querying with hiveql for big data warehousing

B Małysiak-Mrozek, J Wieszok… - … on Fuzzy Systems, 2021 - ieeexplore.ieee.org
Querying and reporting from large volumes of structured, semistructured, and unstructured
data often requires some flexibility. This flexibility provided by fuzzy sets allows for …

A hop** umbrella for fuzzy joining data streams from IoT devices in the cloud and on the edge

D Mrozek, K Tokarz, D Pankowski… - … on Fuzzy Systems, 2019 - ieeexplore.ieee.org
Internet of Things (IoT) is a new technology that changes the image of the current world,
yielding new possibilities, but also proliferating data. IoT devices may constantly produce …

[PDF][PDF] Enhancing big data value using knowledge discovery techniques

M Abdrabo, M Elmogy, G Eltaweel… - IJ Information Technology …, 2016 - academia.edu
The world has been drowned by floods of data due to technological development.
Consequently, the Big Data term has gotten the expression to portray the gigantic sum …

[HTML][HTML] Stage–Specific predictive models for main prognosis measures of breast cancer

AA Said, LA Abd-Elmegid, S Kholeif… - Future Computing and …, 2018 - Elsevier
Breast cancer is a malignant tumor that starts in the cells of the breast. A malignant tumor is
a group of cancer cells that can grow into near tissues or invading the distant areas of the …

[PDF][PDF] Clustering algorithms and their applications in cloud computing environment

US Patki - International Research Journal of Computer Science, 2017 - academia.edu
Cloud computing is Internet-based computing that provides shared computer processing
resources and data to computers and other devices on demand. Cloud computing is the …

Big data analytics using soft computing techniques: A study

DK Sreekantha - Communication and Computing Systems, 2019 - taylorfrancis.com
Big data comprises large volume data having structure, partially structure and no structure
gathered through scientific, social, business and industrial operations continuously. This big …

A Survey on Big Data Analytics Using HADOOP

S Mamatha, T Sudha - Asian Journal of Computer Science and Technology, 2019 - ajcst.co
In this digital world, as organizations are evolving rapidly with data centric asset the
explosion of data and size of the databases have been growing exponentially. Data is …