An overview of end-to-end entity resolution for big data

V Christophides, V Efthymiou, T Palpanas… - ACM Computing …, 2020 - dl.acm.org
One of the most critical tasks for improving data quality and increasing the reliability of data
analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to …

Data cleaning: Overview and emerging challenges

X Chu, IF Ilyas, S Krishnan, J Wang - Proceedings of the 2016 …, 2016 - dl.acm.org
Detecting and repairing dirty data is one of the perennial challenges in data analytics, and
failure to do so can result in inaccurate analytics and unreliable decisions. Over the past few …

Truth inference in crowdsourcing: Is the problem solved?

Y Zheng, G Li, Y Li, C Shan, R Cheng - Proceedings of the VLDB …, 2017 - dl.acm.org
Crowdsourcing has emerged as a novel problem-solving paradigm, which facilitates
addressing problems that are hard for computers, eg, entity resolution and sentiment …

Neural networks for entity matching: A survey

N Barlaug, JA Gulla - ACM Transactions on Knowledge Discovery from …, 2021 - dl.acm.org
Entity matching is the problem of identifying which records refer to the same real-world
entity. It has been actively researched for decades, and a variety of different approaches …

Crowdsourced data management: A survey

G Li, J Wang, Y Zheng… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Any important data management and analytics tasks cannot be completely addressed by
automated processes. These tasks, such as entity resolution, sentiment analysis, and image …

Corleone: Hands-off crowdsourcing for entity matching

C Gokhale, S Das, AH Doan, JF Naughton… - Proceedings of the …, 2014 - dl.acm.org
Recent approaches to crowdsourcing entity matching (EM) are limited in that they
crowdsource only parts of the EM workflow, requiring a developer to execute the remaining …

icrowd: An adaptive crowdsourcing framework

J Fan, G Li, BC Ooi, K Tan, J Feng - Proceedings of the 2015 ACM …, 2015 - dl.acm.org
Crowdsourcing is widely accepted as a means for resolving tasks that machines are not
good at. Unfortunately, Crowdsourcing may yield relatively low-quality results if there is no …

QASCA: A quality-aware task assignment system for crowdsourcing applications

Y Zheng, J Wang, G Li, R Cheng, J Feng - Proceedings of the 2015 ACM …, 2015 - dl.acm.org
A crowdsourcing system, such as the Amazon Mechanical Turk (AMT), provides a platform
for a large number of questions to be answered by Internet workers. Such systems have …

A survey of general-purpose crowdsourcing techniques

AI Chittilappilly, L Chen… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Since Jeff Howe introduced the term Crowdsourcing in 2006, this human-powered problem-
solving paradigm has gained a lot of attention and has been a hot research topic in the field …

Leveraging transitive relations for crowdsourced joins

J Wang, G Li, T Kraska, MJ Franklin… - Proceedings of the 2013 …, 2013 - dl.acm.org
The development of crowdsourced query processing systems has recently attracted a
significant attention in the database community. A variety of crowdsourced queries have …