Blocking and filtering techniques for entity resolution: A survey

G Papadakis, D Skoutas, E Thanos… - ACM Computing Surveys …, 2020 - dl.acm.org
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that
correspond to the same real-world object. Due to its inherently quadratic complexity, a series …

Modern privacy-preserving record linkage techniques: An overview

A Gkoulalas-Divanis, D Vatsalan… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Record linkage is the challenging task of deciding which records, coming from disparate
data sources, refer to the same entity. Established back in 1946 by Halbert L. Dunn, the area …

[LIBRO][B] The four generations of entity resolution

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of
the research examines ways for improving its effectiveness and time efficiency. The initial …

A randomized blocking structure for streaming record linkage

D Karapiperis, C Tjortjis, VS Verykios - Proceedings of the VLDB …, 2023 - dl.acm.org
A huge amount of data, in terms of streams, are collected nowadays via a variety of sources,
such as sensors, mobile devices, or even raw log files. The unprecedented rate at which …

A survey of blocking and filtering techniques for entity resolution

G Papadakis, D Skoutas, E Thanos… - arxiv preprint arxiv …, 2019 - arxiv.org
Efficiency techniques are an integral part of Entity Resolution, since its infancy. In this
survey, we organized the bulk of works in the field into Blocking, Filtering and hybrid …

A Suite of Efficient Randomized Algorithms for Streaming Record Linkage

D Karapiperis, C Tjortjis… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Organizations leverage massive volumes of information and new types of data to generate
unprecedented insights and improve their outcomes. Correctly identifying duplicate records …

A Parallel Multi-Party Privacy-Preserving Record Linkage Method Based on a Consortium Blockchain

S Han, Z Wang, D Shen, C Wang - Mathematics, 2024 - mdpi.com
Privacy-preserving record linkage (PPRL) is the process of linking records from various data
sources, ensuring that matching records for the same entity are shared among parties while …

MultiBlock: A scalable iterative approach for progressive entity resolution

D Karapiperis, A Gkoulalas-Divanis… - … Conference on Big …, 2021 - ieeexplore.ieee.org
Progressive entity resolution techniques aim to allow linking vast amounts of records,
coming from disparate data sources, in a way that provides early access to linkage results of …

Record Linkage Approaches in Big Data: A Comprehensive Review

SF Zahrae, C Ali, A Mohamed - 2024 International Conference …, 2024 - ieeexplore.ieee.org
Analyzing data and making the right decisions have become crucial objectives in various
domains. Record linkage is one of the most important processes for guaranteeing good data …

Efficient record linkage in data streams

D Karapiperis, A Gkoulalas-Divanis… - … Conference on Big …, 2020 - ieeexplore.ieee.org
Nowadays, a vast amount of information is collected in real-time on a daily basis via users'
handheld devices, web-based applications, and customer service interactions (among many …