Subspace clustering for high dimensional data: a review

L Parsons, E Haque, H Liu - Acm sigkdd explorations newsletter, 2004 - dl.acm.org
Subspace clustering is an extension of traditional clustering that seeks to find clusters in
different subspaces within a dataset. Often in high dimensional data, many dimensions are …

Overview and comparative study of dimensionality reduction techniques for high dimensional data

S Ayesha, MK Hanif, R Talib - Information Fusion, 2020 - Elsevier
The recent developments in the modern data collection tools, techniques, and storage
capabilities are leading towards huge volume of data. The dimensions of data indicate the …

Very sparse random projections

P Li, TJ Hastie, KW Church - Proceedings of the 12th ACM SIGKDD …, 2006 - dl.acm.org
There has been considerable interest in random projections, an approximate algorithm for
estimating distances between pairs of points in a high-dimensional vector space. Let A in Rn …

To petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics

RAL Elworth, Q Wang, PK Kota… - Nucleic acids …, 2020 - academic.oup.com
As computational biologists continue to be inundated by ever increasing amounts of
metagenomic data, the need for data analysis approaches that keep up with the pace of …

A novel accelerometer-based gesture recognition system

A Akl, C Feng, S Valaee - IEEE transactions on Signal …, 2011 - ieeexplore.ieee.org
In this paper, we address the problem of gesture recognition using the theory of random
projection (RP) and by formulating the whole recognition problem as an 1-minimization …

An empirical study of required dimensionality for large-scale latent semantic indexing applications

RB Bradford - Proceedings of the 17th ACM conference on …, 2008 - dl.acm.org
The technique of latent semantic indexing is used in a wide variety of commercial
applications. In these applications, the processing time and RAM required for SVD …

Feature extraction methods in quantitative structure–activity relationship modeling: A comparative study

SA Alsenan, IM Al-Turaiki, AM Hafez - Ieee Access, 2020 - ieeexplore.ieee.org
Computational approaches for synthesizing new chemical compounds have resulted in a
major explosion of chemical data in the field of drug discovery. The quantitative structure …

A framework for semantic web services discovery

J Pathak, N Koul, D Caragea, VG Honavar - Proceedings of the 7th …, 2005 - dl.acm.org
This paper describes a framework for ontology-based flexible discovery of Semantic Web
services. The proposed approach relies on user-supplied, context-specific map**s from …

[PDF][PDF] Comparing and combining dimension reduction techniques for efficient text clustering

B Tang, M Shepherd, E Milios… - Proceeding of SIAM …, 2005 - researchgate.net
A great challenge of text mining arises from the increasingly large text datasets and the high
dimensionality associated with natural language. In this research, a systematic study is …

Enhanced vector space models for content-based recommender systems

C Musto - Proceedings of the fourth ACM conference on …, 2010 - dl.acm.org
The use of Vector Space Models (VSM) in the area of Information Retrieval is an established
practice within the scientific community. The reason is twofold: first, its very clean and solid …