Recent advances and emerging challenges of feature selection in the context of big data
In an era of growing data complexity and volume and the advent of big data, feature
selection has a key role to play in hel** reduce high-dimensionality in machine learning …
selection has a key role to play in hel** reduce high-dimensionality in machine learning …
Feature selection for clustering: A review
Dimensionality reduction techniques can be categorized mainly into feature extraction and
feature selection. In the feature extraction approach, features are projected into a new space …
feature selection. In the feature extraction approach, features are projected into a new space …
Whale optimization approaches for wrapper feature selection
Classification accuracy highly dependents on the nature of the features in a dataset which
may contain irrelevant or redundant data. The main aim of feature selection is to eliminate …
may contain irrelevant or redundant data. The main aim of feature selection is to eliminate …
Hybrid whale optimization algorithm with simulated annealing for feature selection
Hybrid metaheuristics are of the most interesting recent trends in optimization and memetic
algorithms. In this paper, two hybridization models are used to design different feature …
algorithms. In this paper, two hybridization models are used to design different feature …
Explainable k-means and k-medians clustering
Many clustering algorithms lead to cluster assignments that are hard to explain, partially
because they depend on all the features of the data in a complicated way. To improve …
because they depend on all the features of the data in a complicated way. To improve …
Randomized algorithms for matrices and data
MW Mahoney - Foundations and Trends® in Machine …, 2011 - nowpublishers.com
Randomized algorithms for very large matrix problems have received a great deal of
attention in recent years. Much of this work was motivated by problems in large-scale data …
attention in recent years. Much of this work was motivated by problems in large-scale data …
Turning Big Data Into Tiny Data: Constant-Size Coresets for -Means, PCA, and Projective Clustering
We develop and analyze a method to reduce the size of a very large set of data points in a
high-dimensional Euclidean space R^d to a small set of weighted points such that the result …
high-dimensional Euclidean space R^d to a small set of weighted points such that the result …
Dimensionality reduction for k-means clustering and low rank approximation
We show how to approximate a data matrix A with a much smaller sketch~ A that can be
used to solve a general class of constrained k-rank approximation problems to within (1+ ε) …
used to solve a general class of constrained k-rank approximation problems to within (1+ ε) …
An efficient approximation to the K-means clustering for massive data
Due to the progressive growth of the amount of data available in a wide variety of scientific
fields, it has become more difficult to manipulate and analyze such information. In spite of its …
fields, it has become more difficult to manipulate and analyze such information. In spite of its …