Large-scale deduplication with constraints using dedupalog
We present a declarative framework for collective deduplication of entity references in the
presence of constraints. Constraints occur naturally in many data cleaning domains and can …
presence of constraints. Constraints occur naturally in many data cleaning domains and can …
Constrained clustering: Current and new trends
Clustering is an unsupervised process which aims to discover regularities and underlying
structures in data. Constrained clustering extends clustering in such a way that expert …
structures in data. Constrained clustering extends clustering in such a way that expert …
Constrained distance based clustering for time-series: a comparative and experimental study
Constrained clustering is becoming an increasingly popular approach in data mining. It
offers a balance between the complexity of producing a formal definition of thematic classes …
offers a balance between the complexity of producing a formal definition of thematic classes …
Constrained locally weighted clustering
Data clustering is a difficult problem due to the complex and heterogeneous natures of
multidimensional data. To improve clustering accuracy, we propose a scheme to capture the …
multidimensional data. To improve clustering accuracy, we propose a scheme to capture the …
A novel semi-supervised approach for network traffic clustering
Network traffic classification is an essential component for network management and
security systems. To address the limitations of traditional port-based and payload-based …
security systems. To address the limitations of traditional port-based and payload-based …
Survey on using constraints in data mining
This paper provides an overview of the current state-of-the-art on using constraints in
knowledge discovery and data mining. The use of constraints in a data mining task requires …
knowledge discovery and data mining. The use of constraints in a data mining task requires …
Pairwise constraint propagation with dual adversarial manifold regularization
Pairwise constraints (PCs) composed of must-links (MLs) and cannot-links (CLs) are widely
used in many semisupervised tasks. Due to the limited number of PCs, pairwise constraint …
used in many semisupervised tasks. Due to the limited number of PCs, pairwise constraint …