Privacy-preserving record linkage for big data: Current approaches and research challenges
Abstract The growth of Big Data, especially personal data dispersed in multiple data
sources, presents enormous opportunities and insights for businesses to explore and …
sources, presents enormous opportunities and insights for businesses to explore and …
Modern privacy-preserving record linkage techniques: An overview
Record linkage is the challenging task of deciding which records, coming from disparate
data sources, refer to the same entity. Established back in 1946 by Halbert L. Dunn, the area …
data sources, refer to the same entity. Established back in 1946 by Halbert L. Dunn, the area …
Deep learning for entity matching: A design space exploration
Entity matching (EM) finds data instances that refer to the same real-world entity. In this
paper we examine applying deep learning (DL) to EM, to understand DL's benefits and …
paper we examine applying deep learning (DL) to EM, to understand DL's benefits and …
Linking sensitive data
Sensitive personal data are created in many application domains, and there is now an
increasing demand to share, integrate, and link such data within and across organisations in …
increasing demand to share, integrate, and link such data within and across organisations in …
On the accuracy and scalability of probabilistic data linkage over the Brazilian 114 million cohort
Data linkage refers to the process of identifying and linking records that refer to the same
entity across multiple heterogeneous data sources. This method has been widely utilized …
entity across multiple heterogeneous data sources. This method has been widely utilized …
Cordel: a contrastive deep learning approach for entity linkage
Entity linkage (EL) is a critical problem in data cleaning and integration. In the past several
decades, EL has typically been done by rule-based systems or traditional machine learning …
decades, EL has typically been done by rule-based systems or traditional machine learning …
The past, present and future of the German Record Linkage Center (GRLC)
Linking data on the same units (such as persons, enterprises or patents) is an increasingly
popular research strategy, also in the social sciences (Schnell, 2014b). Since in many cases …
popular research strategy, also in the social sciences (Schnell, 2014b). Since in many cases …
Incremental clustering techniques for multi-party privacy-preserving record linkage
Abstract Privacy-Preserving Record Linkage (PPRL) supports the integration of sensitive
information from multiple datasets, in particular the privacy-preserving matching of records …
information from multiple datasets, in particular the privacy-preserving matching of records …
Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets
Background Integrating medical data using databases from different sources by record
linkage is a powerful technique increasingly used in medical research. Under many …
linkage is a powerful technique increasingly used in medical research. Under many …
[PDF][PDF] Parallel Privacy-preserving Record Linkage using LSH-based Blocking.
Privacy-preserving record linkage (PPRL) aims at integrating person-related data without
revealing sensitive information. For this purpose, PPRL schemes typically use encoded …
revealing sensitive information. For this purpose, PPRL schemes typically use encoded …