Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Data cleaning and machine learning: a systematic literature review
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …
applications. Because the performance of an ML model is highly dependent on the quality of …
The battleship approach to the low resource entity matching problem
Entity matching, a core data integration problem, is the task of deciding whether two data
tuples refer to the same real-world entity. Recent advances in deep learning methods, using …
tuples refer to the same real-world entity. Recent advances in deep learning methods, using …
Active deep learning on entity resolution by risk sampling
Y Nafa, Q Chen, Z Chen, X Lu, H He, T Duan… - Knowledge-Based …, 2022 - Elsevier
While the state-of-the-art performance on entity resolution (ER) has been achieved by deep
learning, its effectiveness depends on large quantities of accurately labeled training data. To …
learning, its effectiveness depends on large quantities of accurately labeled training data. To …
Deep clustering for data cleaning and integration
Deep Learning (DL) techniques now constitute the state-of-the-art for important problems in
areas such as text and image processing, and there have been impactful results that deploy …
areas such as text and image processing, and there have been impactful results that deploy …
Transformer-based denoising adversarial variational entity resolution
S Li, H Wu - Journal of Intelligent Information Systems, 2023 - Springer
Entity resolution (ER), precisely identifying different representations of the same real-world
entities, is critical for data integration. The ER question has been studied for many years …
entities, is critical for data integration. The ER question has been studied for many years …
Low-resource entity resolution with domain generalization and active learning
Z Xu, N Wang - Neurocomputing, 2024 - Elsevier
Entity Resolution (ER), a fundamental task in data cleaning and integration, is critical in
various fields such as healthcare, e-commerce, and social networks. Traditional ER methods …
various fields such as healthcare, e-commerce, and social networks. Traditional ER methods …
MixER: linear interpolation of latent space for entity resolution
H Wu, S Li - Complex & Intelligent Systems, 2024 - Springer
Entity resolution, accurately identifying various representations of the same real-world
entities, is a crucial part of data integration systems. While existing learning-based models …
entities, is a crucial part of data integration systems. While existing learning-based models …
A Framework to Evaluate the Quality of Integrated Datasets
Evaluation is a bottleneck in data integration processes: it is performed by domain experts
through manual onerous data inspections. This task is particularly heavy in real business …
through manual onerous data inspections. This task is particularly heavy in real business …
SAREM: semi-supervised active heterogeneous entity matching framework
Entity matching is a key technique in data quality research, which refers to the identification
of records that refer to the same real-world entity in different data sources. This paper …
of records that refer to the same real-world entity in different data sources. This paper …
Dual-Module Feature Alignment Domain Adversarial Model for Entity Resolution
H Song, M Liu, S Zhang, Q Han - 2024 11th International …, 2024 - ieeexplore.ieee.org
Entity Resolution (ER) is a fundamental task in data integration, aiming to identify data
objects across different sources that refer to the same real-world entity. In recent years, deep …
objects across different sources that refer to the same real-world entity. In recent years, deep …