Subspace clustering for high dimensional data: a review
Subspace clustering is an extension of traditional clustering that seeks to find clusters in
different subspaces within a dataset. Often in high dimensional data, many dimensions are …
different subspaces within a dataset. Often in high dimensional data, many dimensions are …
Overview and comparative study of dimensionality reduction techniques for high dimensional data
The recent developments in the modern data collection tools, techniques, and storage
capabilities are leading towards huge volume of data. The dimensions of data indicate the …
capabilities are leading towards huge volume of data. The dimensions of data indicate the …
Very sparse random projections
There has been considerable interest in random projections, an approximate algorithm for
estimating distances between pairs of points in a high-dimensional vector space. Let A in Rn …
estimating distances between pairs of points in a high-dimensional vector space. Let A in Rn …
To petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics
As computational biologists continue to be inundated by ever increasing amounts of
metagenomic data, the need for data analysis approaches that keep up with the pace of …
metagenomic data, the need for data analysis approaches that keep up with the pace of …
A novel accelerometer-based gesture recognition system
In this paper, we address the problem of gesture recognition using the theory of random
projection (RP) and by formulating the whole recognition problem as an 1-minimization …
projection (RP) and by formulating the whole recognition problem as an 1-minimization …
An empirical study of required dimensionality for large-scale latent semantic indexing applications
RB Bradford - Proceedings of the 17th ACM conference on …, 2008 - dl.acm.org
The technique of latent semantic indexing is used in a wide variety of commercial
applications. In these applications, the processing time and RAM required for SVD …
applications. In these applications, the processing time and RAM required for SVD …
Feature extraction methods in quantitative structure–activity relationship modeling: A comparative study
Computational approaches for synthesizing new chemical compounds have resulted in a
major explosion of chemical data in the field of drug discovery. The quantitative structure …
major explosion of chemical data in the field of drug discovery. The quantitative structure …
A framework for semantic web services discovery
This paper describes a framework for ontology-based flexible discovery of Semantic Web
services. The proposed approach relies on user-supplied, context-specific map**s from …
services. The proposed approach relies on user-supplied, context-specific map**s from …
[PDF][PDF] Comparing and combining dimension reduction techniques for efficient text clustering
B Tang, M Shepherd, E Milios… - Proceeding of SIAM …, 2005 - researchgate.net
A great challenge of text mining arises from the increasingly large text datasets and the high
dimensionality associated with natural language. In this research, a systematic study is …
dimensionality associated with natural language. In this research, a systematic study is …
Enhanced vector space models for content-based recommender systems
C Musto - Proceedings of the fourth ACM conference on …, 2010 - dl.acm.org
The use of Vector Space Models (VSM) in the area of Information Retrieval is an established
practice within the scientific community. The reason is twofold: first, its very clean and solid …
practice within the scientific community. The reason is twofold: first, its very clean and solid …