Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[KNIHA][B] The data matching process
P Christen, P Christen - 2012 - Springer
This chapter provides an overview of the data matching process, and describes the five
major steps involved in this process: data pre-processing (cleaning and standardisation) …
major steps involved in this process: data pre-processing (cleaning and standardisation) …
Data-Centric Systems and Applications
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …
accessible data source in the world. Web mining aims to discover useful information or …
Fuzzy data operations
A Anderson - US Patent 8,484,215, 2013 - Google Patents
US PATENT DOCUMENTS 5,179,643 A 1/1993 Homma et al. 5,388,259 A 2f1995
Fleischman et al. 5,832,182 A 11/1998 Zhang et al. 6,026,398 A 2/2000 Brown et al …
Fleischman et al. 5,832,182 A 11/1998 Zhang et al. 6,026,398 A 2/2000 Brown et al …
Data clustering based on candidate queries
A Anderson, K Trojan - US Patent 9,361,355, 2016 - Google Patents
6,581,058 B1 6/2003 Fayyad et al. JP HO9-044518 2, 1997 6,658,626 B1 12/2003 Aiken JP
10275159 10, 1998 7,043,476 B2 5/2006 Robson JP 11184884 7, 1999 7,246,128 B2 …
10275159 10, 1998 7,043,476 B2 5/2006 Robson JP 11184884 7, 1999 7,246,128 B2 …
Data clustering, segmentation, and parallelization
A Anderson - US Patent 10,503,755, 2019 - Google Patents
(57) ABSTRACT A first set of original records is processed by a first processing entity to
generate a second set of records that includes the original records and one or more copies …
generate a second set of records that includes the original records and one or more copies …
Data clustering based on variant token networks
A Anderson - US Patent 9,037,589, 2015 - Google Patents
6,493,709 B1 12/2002 Aiken JP HO9-044518 2, 1997 6,581,058 B1 6/2003 Fayyad et al. JP
10275159 10, 1998 6,658,626 B1 12/2003 Aiken JP 11184884 7, 1999 7,043,476 B2 …
10275159 10, 1998 6,658,626 B1 12/2003 Aiken JP 11184884 7, 1999 7,043,476 B2 …
Connected Components for Scaling Partial-order Blocking to Billion Entities
T Backes, S Dietze - ACM Journal of Data and Information Quality, 2024 - dl.acm.org
In entity resolution, blocking pre-partitions data for further processing by more expensive
methods. Two entity mentions are in the same block if they share identical or related …
methods. Two entity mentions are in the same block if they share identical or related …
An incremental graph-partitioning algorithm for entity resolution
Entity resolution is an important data association task when fusing information from multiple
sources. Oftentimes the information arrives continuously and the entity resolution algorithm …
sources. Oftentimes the information arrives continuously and the entity resolution algorithm …
Fuzzy data operations
A Anderson - US Patent 9,607,103, 2017 - Google Patents
7.283, 999 B1 10/2007 Ramesh et al. JP HO9-044518 2, 1997 7.287, 019 B2 10/2007
Kapoor et al. JP 10275159 10, 1998 7,392.247 B2 6, 2008 Chen et al. JP H11-003342 1 …
Kapoor et al. JP 10275159 10, 1998 7,392.247 B2 6, 2008 Chen et al. JP H11-003342 1 …
Adaptive temporal entity resolution on dynamic databases
Entity resolution is the process of matching records that refer to the same entities from one or
several databases in situations where the records to be matched do not include unique …
several databases in situations where the records to be matched do not include unique …