Blocking and filtering techniques for entity resolution: A survey

G Papadakis, D Skoutas, E Thanos… - ACM Computing Surveys …, 2020 - dl.acm.org
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that
correspond to the same real-world object. Due to its inherently quadratic complexity, a series …

Blockchain-based privacy-preserving record linkage: enhancing data privacy in an untrusted environment

T Nóbrega, CES Pires, DC Nascimento - Information Systems, 2021 - Elsevier
Abstract Privacy-Preserving Record Linkage (PPRL) intends to integrate private data from
several data sources held by different parties. Due to recent laws and regulations (eg …

Scaling entity resolution: A loosely schema-aware approach

G Simonini, L Gagliardelli, S Bergamaschi… - Information Systems, 2019 - Elsevier
In big data sources, real-world entities are typically represented with a variety of schemata
and formats (eg, relational records, JSON objects, etc.). Different profiles (ie …

High-value token-blocking: efficient blocking method for record linkage

K O'hare, A Jurek-Loughrey… - ACM Transactions on …, 2021 - dl.acm.org
Data integration is an important component of Big Data analytics. One of the key challenges
in data integration is record linkage, that is, matching records that represent the same real …

Towards automatic privacy-preserving record linkage: A transfer learning based classification step

T Nóbrega, CES Pires, DC Nascimento… - Data & Knowledge …, 2023 - Elsevier
Abstract Privacy-Preserving Record Linkage (PPRL) intends to identify records that match
the same real-world entities across disparate data sources while preserving the privacy of …

Incremental blocking for entity resolution over web streaming data

T Brasileiro Araújo, K Stefanidis… - IEEE/WIC/ACM …, 2019 - dl.acm.org
The widespread use of information systems has become a valuable source of semi-
structured data. In this context, Entity Resolution (ER) emerges as a fundamental task to …

A noise tolerant and schema-agnostic blocking technique for entity resolution

TB Araújo, CES Pires, DG Mestre, TP Nóbrega… - Proceedings of the 34th …, 2019 - dl.acm.org
The increasing use of Web systems has become a valuable source of semi-structured data.
In this context, the Entity Resolution (ER) task emerges as a fundamental step to integrate …