Leveraging transitive relations for crowdsourced joins
The development of crowdsourced query processing systems has recently attracted a
significant attention in the database community. A variety of crowdsourced queries have …
significant attention in the database community. A variety of crowdsourced queries have …
Crowdsourcing algorithms for entity resolution
N Vesdapunt, K Bellare, N Dalvi - Proceedings of the VLDB Endowment, 2014 - dl.acm.org
In this paper, we study a hybrid human-machine approach for solving the problem of Entity
Resolution (ER). The goal of ER is to identify all records in a database that refer to the same …
Resolution (ER). The goal of ER is to identify all records in a database that refer to the same …
Crowdsourced data management: Industry and academic perspectives
Crowdsourcing and human computation enable organizations to accomplish tasks that are
currently not possible for fully automated techniques to complete, or require more flexibility …
currently not possible for fully automated techniques to complete, or require more flexibility …
Crowdsourcing for data management
Crowdsourcing provides access to a pool of human workers who can contribute solutions to
tasks that are challenging for computers. Proposals have been made for the use of …
tasks that are challenging for computers. Proposals have been made for the use of …
Crowd-based deduplication: An adaptive approach
Data deduplication stands as a building block for data integration and data cleaning. The
state-of-the-art techniques focus on how to exploit crowdsourcing to improve the accuracy of …
state-of-the-art techniques focus on how to exploit crowdsourcing to improve the accuracy of …
An overview of the deco system: data model and query language; query processing and optimization
Deco is a comprehensive system for answering declarative queries posed over stored
relational data together with data obtained on-demand from the crowd. In this overview …
relational data together with data obtained on-demand from the crowd. In this overview …
An introduction to hybrid human-machine information systems
Abstract Hybrid Human-Machine Information Systems leverage novel architectures that
make systematic use of Human Computation by means of crowdsourcing. These …
make systematic use of Human Computation by means of crowdsourcing. These …
Draining the data swamp: A similarity-based approach
While hierarchical namespaces such as filesystems and repositories have long been used
to organize data, the rapid increase in data production places increasing strain on users …
to organize data, the rapid increase in data production places increasing strain on users …
Social data analysis framework in cloud and mobility analyzer for smarter cities
One emerging and challenging issue in Smarter Cities is Mobility. In order to gather mobility
data for relevant analyses, two solutions are widely discussed, namely the conventional …
data for relevant analyses, two solutions are widely discussed, namely the conventional …
A hybrid data deduplication approach in entity resolution using chromatic correlation clustering
Entity resolution (ER) classifies records that refer to the same real-world entity and is
fundamental to data cleaning. Identifying approximate but not exact duplicates in database …
fundamental to data cleaning. Identifying approximate but not exact duplicates in database …