Parallel and scalable Dunn Index for the validation of big data clusters

CEB Ncir, A Hamza, W Bouaguel - Parallel Computing, 2021 - Elsevier
Parallelizing data clustering algorithms has attracted the interest of many researchers over
the past few years. Many efficient parallel algorithms were proposed to build partitioning …

Parallel and distributed successive convex approximation methods for big-data optimization

A Nedić, JS Pang, G Scutari, Y Sun, G Scutari… - Multi-Agent Optimization …, 2018 - Springer
Recent years have witnessed a surge of interest in parallel and distributed optimization
methods for large-scale systems. In particular, nonconvex large-scale optimization problems …

Preconditioned data sparsification for big data with applications to PCA and K-means

F Pourkamali-Anaraki, S Becker - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
We analyze a compression scheme for large data sets that randomly keeps a small
percentage of the components of each data sample. The benefit is that the output is a sparse …

Big Data in civil engineering: a state-of-the-art survey

O Kapliński, N Košeleva, G Ropaitė - Engineering Structures and …, 2016 - Taylor & Francis
Data generation has increased drastically over the past few years. Data management has
also grown in importance because extracting the significant value out of a huge pile of raw …

Sketched subspace clustering

PA Traganitis, GB Giannakis - IEEE Transactions on Signal …, 2017 - ieeexplore.ieee.org
The immense amount of daily generated and communicated data presents unique
challenges in their processing. Clustering, the grou** of data without the presence of …

A hybrid MapReduce-based k-means clustering using genetic algorithm for distributed datasets

A Sinha, PK Jana - The Journal of Supercomputing, 2018 - Springer
Clustering a large volume of data in a distributed environment is a challenging issue. Data
stored across multiple machines are huge in size, and solution space is large. Genetic …

Combining density peaks clustering and gravitational search method to enhance data clustering

L Sun, T Tao, X Zheng, S Bao, Y Luo - Engineering Applications of Artificial …, 2019 - Elsevier
Data clustering is a valuable field for extracting effective information and hidden patterns
from datasets. In this paper we propose a clustering approach based on density peaks …

Randomized clustered nystrom for large-scale kernel machines

F Pourkamali-Anaraki, S Becker, M Wakin - Proceedings of the AAAI …, 2018 - ojs.aaai.org
The Nystrom method is a popular technique for generating low-rank approximations of
kernel matrices that arise in many machine learning problems. The approximation quality of …

MapReduce framework based big data clustering using fractional integrated sparse fuzzy C means algorithm

O Kulkarni, S Jena, V Ravi Sankar - IET Image Processing, 2020 - Wiley Online Library
Big data analytics gain significant interest over the traditional data‐processing
methodologies that engage in extracting the hidden patterns and correlations from the …

A secure technique for unstructured big data using clustering method

MT Nafis, R Biswas - International Journal of Information Technology, 2022 - Springer
Analyzing the multi-dimensional data in faster way is an important and basic aspect in any
clustering mechanism. At the same time the clustering mechanism should provide a security …