From data mining to wisdom mining
The knowledge gained from data mining is highly dependent on the experience of an expert
for further analysis to increase effectiveness and wise decision-making. This mined …
for further analysis to increase effectiveness and wise decision-making. This mined …
[PDF][PDF] A state-of-the-art survey on semantic similarity for document clustering using GloVe and density-based algorithms
SM Mohammed, K Jacksi… - Indonesian Journal of …, 2021 - pdfs.semanticscholar.org
Semantic similarity is the process of identifying relevant data semantically. The traditional
way of identifying document similarity is by using synonymous keywords and syntactician. In …
way of identifying document similarity is by using synonymous keywords and syntactician. In …
A review of approaches for the detection and treatment of outliers in processing wind turbine and wind farm measurements
M Zou, SZ Djokic - Energies, 2020 - mdpi.com
Due to the significant increase of the number of wind-based electricity generation systems, it
is important to have accurate information on their operational characteristics, which are …
is important to have accurate information on their operational characteristics, which are …
K-means clustering with incomplete data
Clustering has been intensively studied in machine learning and data mining communities.
Although demonstrating promising performance in various applications, most of the existing …
Although demonstrating promising performance in various applications, most of the existing …
Density peaks clustering based on weighted local density sequence and nearest neighbor assignment
Density peaks clustering (DPC) is a density-based clustering algorithm with excellent
clustering performance including accuracy, automatically detecting the number of clusters …
clustering performance including accuracy, automatically detecting the number of clusters …
CHIEF: Clustering with higher-order motifs in big networks
Clustering network vertices is an enabler of various applications such as social computing
and Internet of Things. However, challenges arise for clustering when networks increase in …
and Internet of Things. However, challenges arise for clustering when networks increase in …
KNN-SC: novel spectral clustering algorithm using k-nearest neighbors
Spectral clustering is a well-known graph-theoretic clustering algorithm. Although spectral
clustering has several desirable advantages (such as the capability of discovering non …
clustering has several desirable advantages (such as the capability of discovering non …
Stacked denoising autoencoder with density-grid based clustering method for detecting outlier of wind turbine components
Z Sun, H Sun - IEEE Access, 2019 - ieeexplore.ieee.org
Different types of outliers have existed in the monitoring data of wind turbines, which are not
conducive to the follow-up data mining. However, the complex inner characteristics of the …
conducive to the follow-up data mining. However, the complex inner characteristics of the …
An internal validity index based on density-involved distance
It is crucial to evaluate the quality of clustering results in cluster analysis. Although many
cluster validity indices (CVIs) have been proposed in the literature, they have some …
cluster validity indices (CVIs) have been proposed in the literature, they have some …
Research on detection methods based on Doc2vec abnormal comments
W Chang, Z Xu, S Zhou, W Cao - Future Generation Computer Systems, 2018 - Elsevier
The purpose of this paper is to explore a method for detecting abnormal comments. With the
growth of e-commerce sites and reviews sites, user reviews of messages begin to affect …
growth of e-commerce sites and reviews sites, user reviews of messages begin to affect …