An overview of end-to-end entity resolution for big data
One of the most critical tasks for improving data quality and increasing the reliability of data
analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to …
analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to …
Data cleaning: Overview and emerging challenges
Detecting and repairing dirty data is one of the perennial challenges in data analytics, and
failure to do so can result in inaccurate analytics and unreliable decisions. Over the past few …
failure to do so can result in inaccurate analytics and unreliable decisions. Over the past few …
Crowdsourced data management: A survey
Any important data management and analytics tasks cannot be completely addressed by
automated processes. These tasks, such as entity resolution, sentiment analysis, and image …
automated processes. These tasks, such as entity resolution, sentiment analysis, and image …
A framework for protecting worker location privacy in spatial crowdsourcing
Spatial Crowdsourcing (SC) is a transformative platform that engages individuals, groups
and communities in the act of collecting, analyzing, and disseminating environmental, social …
and communities in the act of collecting, analyzing, and disseminating environmental, social …
The dynamics of micro-task crowdsourcing: The case of amazon mturk
Micro-task crowdsourcing is rapidly gaining popularity among research communities and
businesses as a means to leverage Human Computation in their daily operations. Unlike …
businesses as a means to leverage Human Computation in their daily operations. Unlike …
Corleone: Hands-off crowdsourcing for entity matching
Recent approaches to crowdsourcing entity matching (EM) are limited in that they
crowdsource only parts of the EM workflow, requiring a developer to execute the remaining …
crowdsource only parts of the EM workflow, requiring a developer to execute the remaining …
QASCA: A quality-aware task assignment system for crowdsourcing applications
A crowdsourcing system, such as the Amazon Mechanical Turk (AMT), provides a platform
for a large number of questions to be answered by Internet workers. Such systems have …
for a large number of questions to be answered by Internet workers. Such systems have …
A survey of general-purpose crowdsourcing techniques
Since Jeff Howe introduced the term Crowdsourcing in 2006, this human-powered problem-
solving paradigm has gained a lot of attention and has been a hot research topic in the field …
solving paradigm has gained a lot of attention and has been a hot research topic in the field …
Trends in cleaning relational data: Consistency and deduplication
Data quality is one of the most important problems in data management, since dirty data
often leads to inaccurate data analytics results and wrong business decisions. Poor data …
often leads to inaccurate data analytics results and wrong business decisions. Poor data …
Crowdsourcing algorithms for entity resolution
N Vesdapunt, K Bellare, N Dalvi - Proceedings of the VLDB Endowment, 2014 - dl.acm.org
In this paper, we study a hybrid human-machine approach for solving the problem of Entity
Resolution (ER). The goal of ER is to identify all records in a database that refer to the same …
Resolution (ER). The goal of ER is to identify all records in a database that refer to the same …