An overview of end-to-end entity resolution for big data

V Christophides, V Efthymiou, T Palpanas… - ACM Computing …, 2020 - dl.acm.org
One of the most critical tasks for improving data quality and increasing the reliability of data
analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to …

Blocking and filtering techniques for entity resolution: A survey

G Papadakis, D Skoutas, E Thanos… - ACM Computing Surveys …, 2020 - dl.acm.org
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that
correspond to the same real-world object. Due to its inherently quadratic complexity, a series …

Deep learning for entity matching: A design space exploration

S Mudgal, H Li, T Rekatsinas, AH Doan… - Proceedings of the …, 2018 - dl.acm.org
Entity matching (EM) finds data instances that refer to the same real-world entity. In this
paper we examine applying deep learning (DL) to EM, to understand DL's benefits and …

[КНИГА][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Pre-trained embeddings for entity resolution: an experimental analysis

A Zeakis, G Papadakis, D Skoutas… - Proceedings of the VLDB …, 2023 - dl.acm.org
Many recent works on Entity Resolution (ER) leverage Deep Learning techniques involving
language models to improve effectiveness. This is applied to both main steps of ER, ie …

Neural networks for entity matching: A survey

N Barlaug, JA Gulla - ACM Transactions on Knowledge Discovery from …, 2021 - dl.acm.org
Entity matching is the problem of identifying which records refer to the same real-world
entity. It has been actively researched for decades, and a variety of different approaches …

Linking sensitive data

P Christen, T Ranbaduge, R Schnell - Methods and techniques for …, 2020 - Springer
Sensitive personal data are created in many application domains, and there is now an
increasing demand to share, integrate, and link such data within and across organisations in …

A survey on blocking technology of entity resolution

BH Li, Y Liu, AM Zhang, WH Wang, S Wan - Journal of Computer Science …, 2020 - Springer
Entity resolution (ER) is a significant task in data integration, which aims to detect all entity
profiles that correspond to the same real-world entity. Due to its inherently quadratic …

Autoknow: Self-driving knowledge collection for products of thousands of types

XL Dong, X He, A Kan, X Li, Y Liang, J Ma… - Proceedings of the 26th …, 2020 - dl.acm.org
Can one build a knowledge graph (KG) for all products in the world? Knowledge graphs
have firmly established themselves as valuable sources of information for search and …

Zeroer: Entity resolution using zero labeled examples

R Wu, S Chaba, S Sawlani, X Chu… - Proceedings of the …, 2020 - dl.acm.org
Entity resolution (ER) refers to the problem of matching records in one or more relations that
refer to the same real-world entity. While supervised machine learning (ML) approaches …