Is it time to stop swee** data cleaning under the carpet? A novel algorithm for outlier management in growth data

CSC Woolley, IG Handel, BM Bronsvoort… - PloS one, 2020 - journals.plos.org
All data are prone to error and require data cleaning prior to analysis. An important example
is longitudinal growth data, for which there are no universally agreed standard methods for …

[HTML][HTML] Methodology for fuzzy duplicate record identification based on the semantic-syntactic information of similarity

D Hadzic, N Sarajlic - Journal of King Saud University-Computer and …, 2020 - Elsevier
There are different methodologies for identification of fuzzy duplicate records in the process
of data cleaning for data warehouse and data mining. The methodologies for duplicate …

An Interactive Visual Analysis Method for Multi-Dimensional Data Deduplication

H Zhu, Z Qian, F Yan, K Mao, H Ying, J Wang… - Journal of Computer …, 2022 - jcad.cn
Duplication in multi-dimensional data seriously interferes with data mining, analysis, and
application. Traditional data deduplication methods cannot meet the requirements for …

Identificação e Caracterização de Reclamações Duplicadas por Consumidores em Múltiplas Plataformas

G Rabbi, MMR Araújo, G Kakizaki, J Viterbo… - Simpósio Brasileiro de …, 2024 - sol.sbc.org.br
O crescente volume de dados em repositórios de reclamações de consumidores impõe
desafios significativos para a gestão eficaz dessas informações. Dentre estes desafios …

[PDF][PDF] Methods and techniques to evaluate the performance of Data Cleansing Algorithms for very Large Database Systems

RDDRM Chezian - International Journal of Advanced Research in …, 2016 - ngmcollege.in
The data cleansing algorithm has a key role in this competitive environment as for decision
making considered the system requires more precise information.. Yet the inconsistency in …

Abnormal Analysis and Treatment of Voltage Test Data Based on Deep Learning

HW Xu, Y Wang, XL Yang, PF Yang - Proceedings of the 2023 …, 2023 - dl.acm.org
How to find the abnormal points in data effectively and quickly and give a reasonable
explanation is the main content of anomaly detection. The development of deep learning …

支持多维度数据去重的交互式可视分析方法

朱海洋, 钱中昊, 严凡, 毛科添, 应昊键, 王杰… - 计算机辅助设计与图形学 …, 2022 - jcad.cn
多维度数据中的重复数据会严重影响数据的挖掘, 分析与应用. 针对传统的数据去重方法的成本,
效率和便捷性无法满足大数据分析需求的问题, 提出一种数据去重的交互式可视分析方法 …

TLBO-Based Optimal Speed Controller Design for Induction Motors Using Fuzzy Sliding Mode Controller

A Alfi - مجله علمی رایانش نرم و فناوری اطلاعات, 2017‎ - jscit.nit.ac.ir
Mobile Ad Hoc Network (MANET) is a special and attractive type of new wireless networks. It
is an autonomous system that can dynamically be set up anywhere and anytime without …

Using fuzzy logic technique to eliminate the duplicates in large database

MM Hamad, AA Jihad - Journal of University of Human …, 2015 - journals.uhd.edu.iq
Duplicate records are broad problem in many of the databases. There are wide efforts
focusing on elimination of duplicate in data sets, because is it important part of data …

De-duplication framework to reduce the record linkage problem

A Dagade, M Mali - 2017 International Conference on …, 2017 - ieeexplore.ieee.org
User profiling plays a very crucial role in government organizations. Fraudulent cases occur
due to multiple representations of user profile for a single user. Record linkage or record …