Parallel and scalable Dunn Index for the validation of big data clusters
Parallelizing data clustering algorithms has attracted the interest of many researchers over
the past few years. Many efficient parallel algorithms were proposed to build partitioning …
the past few years. Many efficient parallel algorithms were proposed to build partitioning …
Parallel and distributed successive convex approximation methods for big-data optimization
Recent years have witnessed a surge of interest in parallel and distributed optimization
methods for large-scale systems. In particular, nonconvex large-scale optimization problems …
methods for large-scale systems. In particular, nonconvex large-scale optimization problems …
Preconditioned data sparsification for big data with applications to PCA and K-means
We analyze a compression scheme for large data sets that randomly keeps a small
percentage of the components of each data sample. The benefit is that the output is a sparse …
percentage of the components of each data sample. The benefit is that the output is a sparse …
Big Data in civil engineering: a state-of-the-art survey
O Kapliński, N Košeleva, G Ropaitė - Engineering Structures and …, 2016 - Taylor & Francis
Data generation has increased drastically over the past few years. Data management has
also grown in importance because extracting the significant value out of a huge pile of raw …
also grown in importance because extracting the significant value out of a huge pile of raw …
Sketched subspace clustering
The immense amount of daily generated and communicated data presents unique
challenges in their processing. Clustering, the grou** of data without the presence of …
challenges in their processing. Clustering, the grou** of data without the presence of …
A hybrid MapReduce-based k-means clustering using genetic algorithm for distributed datasets
Clustering a large volume of data in a distributed environment is a challenging issue. Data
stored across multiple machines are huge in size, and solution space is large. Genetic …
stored across multiple machines are huge in size, and solution space is large. Genetic …
Combining density peaks clustering and gravitational search method to enhance data clustering
Data clustering is a valuable field for extracting effective information and hidden patterns
from datasets. In this paper we propose a clustering approach based on density peaks …
from datasets. In this paper we propose a clustering approach based on density peaks …
Randomized clustered nystrom for large-scale kernel machines
The Nystrom method is a popular technique for generating low-rank approximations of
kernel matrices that arise in many machine learning problems. The approximation quality of …
kernel matrices that arise in many machine learning problems. The approximation quality of …
MapReduce framework based big data clustering using fractional integrated sparse fuzzy C means algorithm
Big data analytics gain significant interest over the traditional data‐processing
methodologies that engage in extracting the hidden patterns and correlations from the …
methodologies that engage in extracting the hidden patterns and correlations from the …
A secure technique for unstructured big data using clustering method
MT Nafis, R Biswas - International Journal of Information Technology, 2022 - Springer
Analyzing the multi-dimensional data in faster way is an important and basic aspect in any
clustering mechanism. At the same time the clustering mechanism should provide a security …
clustering mechanism. At the same time the clustering mechanism should provide a security …