[KNIHA][B] The data matching process

P Christen, P Christen - 2012 - Springer
This chapter provides an overview of the data matching process, and describes the five
major steps involved in this process: data pre-processing (cleaning and standardisation) …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

Fuzzy data operations

A Anderson - US Patent 8,484,215, 2013 - Google Patents
US PATENT DOCUMENTS 5,179,643 A 1/1993 Homma et al. 5,388,259 A 2f1995
Fleischman et al. 5,832,182 A 11/1998 Zhang et al. 6,026,398 A 2/2000 Brown et al …

Data clustering based on candidate queries

A Anderson, K Trojan - US Patent 9,361,355, 2016 - Google Patents
6,581,058 B1 6/2003 Fayyad et al. JP HO9-044518 2, 1997 6,658,626 B1 12/2003 Aiken JP
10275159 10, 1998 7,043,476 B2 5/2006 Robson JP 11184884 7, 1999 7,246,128 B2 …

Data clustering, segmentation, and parallelization

A Anderson - US Patent 10,503,755, 2019 - Google Patents
(57) ABSTRACT A first set of original records is processed by a first processing entity to
generate a second set of records that includes the original records and one or more copies …

Data clustering based on variant token networks

A Anderson - US Patent 9,037,589, 2015 - Google Patents
6,493,709 B1 12/2002 Aiken JP HO9-044518 2, 1997 6,581,058 B1 6/2003 Fayyad et al. JP
10275159 10, 1998 6,658,626 B1 12/2003 Aiken JP 11184884 7, 1999 7,043,476 B2 …

Connected Components for Scaling Partial-order Blocking to Billion Entities

T Backes, S Dietze - ACM Journal of Data and Information Quality, 2024 - dl.acm.org
In entity resolution, blocking pre-partitions data for further processing by more expensive
methods. Two entity mentions are in the same block if they share identical or related …

An incremental graph-partitioning algorithm for entity resolution

G Tauer, K Date, R Nagi, M Sudit - Information Fusion, 2019 - Elsevier
Entity resolution is an important data association task when fusing information from multiple
sources. Oftentimes the information arrives continuously and the entity resolution algorithm …

Fuzzy data operations

A Anderson - US Patent 9,607,103, 2017 - Google Patents
7.283, 999 B1 10/2007 Ramesh et al. JP HO9-044518 2, 1997 7.287, 019 B2 10/2007
Kapoor et al. JP 10275159 10, 1998 7,392.247 B2 6, 2008 Chen et al. JP H11-003342 1 …

Adaptive temporal entity resolution on dynamic databases

P Christen, RW Gayler - Advances in Knowledge Discovery and Data …, 2013 - Springer
Entity resolution is the process of matching records that refer to the same entities from one or
several databases in situations where the records to be matched do not include unique …