- Academic Search

V Chandola, A Banerjee, V Kumar - ACM computing surveys (CSUR), 2009 - dl.acm.org

Anomaly detection is an important problem that has been researched within diverse
research areas and application domains. Many anomaly detection techniques have been …

Save Cite Cited by 15746 Related articles All 34 versions Free GPT-4

[Free GPT-4]

[PDF] chalmers.se

Subspace clustering for high dimensional data: a review

L Parsons, E Haque, H Liu - Acm sigkdd explorations newsletter, 2004 - dl.acm.org

Subspace clustering is an extension of traditional clustering that seeks to find clusters in
different subspaces within a dataset. Often in high dimensional data, many dimensions are …

Save Cite Cited by 1974 Related articles All 14 versions Free GPT-4

[Free GPT-4]

[PDF] mdpi.com

Research on K-value selection method of K-means clustering algorithm

C Yuan, H Yang - J, 2019 - mdpi.com

Among many clustering algorithms, the K-means clustering algorithm is widely used
because of its simple algorithm and fast convergence. However, the K-value of clustering …

Save Cite Cited by 984 Related articles All 5 versions Free GPT-4 Cached

[Free GPT-4]

[PDF] acm.org

BIRCH: an efficient data clustering method for very large databases

T Zhang, R Ramakrishnan, M Livny - ACM sigmod record, 1996 - dl.acm.org

Finding useful patterns in large datasets has attracted considerable interest recently, and
one of the most widely studied problems in this area is the identification of clusters, or …

Save Cite Cited by 7973 Related articles All 47 versions Free GPT-4

[Free GPT-4]

[PDF] u-tokyo.ac.jp

A survey of clustering data mining techniques

P Berkhin - Grou** multidimensional data: Recent advances in …, 2006 - Springer

Clustering is the division of data into groups of similar objects. In clustering, some details are
disregarded in exchange for data simplification. Clustering can be viewed as a data …

Save Cite Cited by 5029 Related articles All 23 versions Free GPT-4

[Free GPT-4]

[PDF] siam.org

[BOOK][B] Data clustering: theory, algorithms, and applications

G Gan, C Ma, J Wu - 2020 - SIAM

The monograph Data Clustering: Theory, Algorithms, and Applications was published in
2007. Starting with the common ground and knowledge for data clustering, the monograph …

Save Cite Cited by 2690 Related articles All 11 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] psu.edu

[BOOK][B] The data matching process

P Christen, P Christen - 2012 - Springer

This chapter provides an overview of the data matching process, and describes the five
major steps involved in this process: data pre-processing (cleaning and standardisation) …

Save Cite Cited by 1665 Related articles All 13 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] nature.com

Clustering huge protein sequence sets in linear time

M Steinegger, J Söding - Nature communications, 2018 - nature.com

Metagenomic datasets contain billions of protein sequences that could greatly enhance
large-scale functional annotation and structure prediction. Utilizing this enormous resource …

Save Cite Cited by 770 Related articles All 20 versions Free GPT-4

[Free GPT-4]

[PDF] ssrn.com

Duplicate record detection: A survey

AK Elmagarmid, PG Ipeirotis… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org

Often, in the real world, entities have two or more representations in databases. Duplicate
records do not share a common key and/or they contain errors that make duplicate matching …

Save Cite Cited by 2815 Related articles All 16 versions Free GPT-4

[Free GPT-4]

[HTML] acm.org

A survey of techniques for event detection in twitter

F Atefeh, W Khreich - Computational Intelligence, 2015 - Wiley Online Library

Twitter is among the fastest‐growing microblogging and online social networking services.
Messages posted on Twitter (tweets) have been reporting everything from daily life stories to …

Save Cite Cited by 985 Related articles All 10 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Efficient clustering of high-dimensional data sets with application to reference matching

Anomaly detection: A survey

Subspace clustering for high dimensional data: a review

Research on K-value selection method of K-means clustering algorithm

BIRCH: an efficient data clustering method for very large databases

A survey of clustering data mining techniques

[BOOK][B] Data clustering: theory, algorithms, and applications

[BOOK][B] The data matching process

Clustering huge protein sequence sets in linear time

Duplicate record detection: A survey

A survey of techniques for event detection in twitter