An overview of end-to-end entity resolution for big data

V Christophides, V Efthymiou, T Palpanas… - ACM Computing …, 2020 - dl.acm.org
One of the most critical tasks for improving data quality and increasing the reliability of data
analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to …

[LIBRO][B] The four generations of entity resolution

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of
the research examines ways for improving its effectiveness and time efficiency. The initial …

Using link features for entity clustering in knowledge graphs

A Saeedi, E Peukert, E Rahm - European Semantic Web Conference, 2018 - Springer
Abstract Knowledge graphs holistically integrate information about entities from multiple
sources. A key step in the construction and maintenance of knowledge graphs is the …

End-to-end entity resolution for big data: A survey

V Christophides, V Efthymiou, T Palpanas… - arxiv preprint arxiv …, 2019 - arxiv.org
One of the most important tasks for improving data quality and the reliability of data analytics
results is Entity Resolution (ER). ER aims to identify different descriptions that refer to the …

The case for holistic data integration

E Rahm - Advances in Databases and Information Systems: 20th …, 2016 - Springer
Current data integration approaches are mostly limited to few data sources, partly due to the
use of binary match approaches between pairs of sources. We thus advocate for the …

Comparative evaluation of distributed clustering schemes for multi-source entity resolution

A Saeedi, E Peukert, E Rahm - … 2017, Nicosia, Cyprus, September 24-27 …, 2017 - Springer
Entity resolution identifies semantically equivalent entities, eg, describing the same product
or customer. It is especially challenging for big data applications where large volumes of …

Scalable matching and clustering of entities with FAMER

A Saeedi, M Nentwig, E Peukert… - … Systems Informatics and …, 2018 - journals.rtu.lv
Entity resolution identifies semantically equivalent entities, eg describing the same product
or customer. It is especially challenging for Big Data applications where large volumes of …

Transforming pairwise duplicates to entity clusters for high-quality duplicate detection

U Draisbach, P Christen, F Naumann - Journal of Data and Information …, 2019 - dl.acm.org
Duplicate detection algorithms produce clusters of database records, each cluster
representing a single real-world entity. As most of these algorithms use pairwise …

DBpedia FlexiFusion the best of Wikipedia> Wikidata> your data

J Frey, M Hofer, D Obraczka, J Lehmann… - The Semantic Web …, 2019 - Springer
The data quality improvement of DBpedia has been in the focus of many publications in the
past years with topics covering both knowledge enrichment techniques such as type …

Incremental clustering on linked data

M Nentwig, E Rahm - 2018 IEEE International Conference on …, 2018 - ieeexplore.ieee.org
Data integration in the Web of Data is not limited to the pairwise linking of entities but often
requires to cluster entities of different sources, eg, within knowledge graphs. Such entity …