Leveraging transitive relations for crowdsourced joins

J Wang, G Li, T Kraska, MJ Franklin… - Proceedings of the 2013 …, 2013 - dl.acm.org
The development of crowdsourced query processing systems has recently attracted a
significant attention in the database community. A variety of crowdsourced queries have …

Crowdsourcing algorithms for entity resolution

N Vesdapunt, K Bellare, N Dalvi - Proceedings of the VLDB Endowment, 2014 - dl.acm.org
In this paper, we study a hybrid human-machine approach for solving the problem of Entity
Resolution (ER). The goal of ER is to identify all records in a database that refer to the same …

Crowdsourced data management: Industry and academic perspectives

A Marcus, A Parameswaran - Foundations and Trends® in …, 2015 - nowpublishers.com
Crowdsourcing and human computation enable organizations to accomplish tasks that are
currently not possible for fully automated techniques to complete, or require more flexibility …

Crowdsourcing for data management

V Crescenzi, AAA Fernandes, P Merialdo… - … and Information Systems, 2017 - Springer
Crowdsourcing provides access to a pool of human workers who can contribute solutions to
tasks that are challenging for computers. Proposals have been made for the use of …

Crowd-based deduplication: An adaptive approach

S Wang, X **ao, CH Lee - Proceedings of the 2015 ACM SIGMOD …, 2015 - dl.acm.org
Data deduplication stands as a building block for data integration and data cleaning. The
state-of-the-art techniques focus on how to exploit crowdsourcing to improve the accuracy of …

An overview of the deco system: data model and query language; query processing and optimization

H Park, R Pang, A Parameswaran… - ACM SIGMOD …, 2013 - dl.acm.org
Deco is a comprehensive system for answering declarative queries posed over stored
relational data together with data obtained on-demand from the crowd. In this overview …

An introduction to hybrid human-machine information systems

G Demartini, DE Difallah, U Gadiraju… - … and Trends® in Web …, 2017 - nowpublishers.com
Abstract Hybrid Human-Machine Information Systems leverage novel architectures that
make systematic use of Human Computation by means of crowdsourcing. These …

Draining the data swamp: A similarity-based approach

W Brackenbury, R Liu, M Mondal, AJ Elmore… - Proceedings of the …, 2018 - dl.acm.org
While hierarchical namespaces such as filesystems and repositories have long been used
to organize data, the rapid increase in data production places increasing strain on users …

Social data analysis framework in cloud and mobility analyzer for smarter cities

L You, G Motta, D Sacco, T Ma - Proceedings of 2014 IEEE …, 2014 - ieeexplore.ieee.org
One emerging and challenging issue in Smarter Cities is Mobility. In order to gather mobility
data for relevant analyses, two solutions are widely discussed, namely the conventional …

A hybrid data deduplication approach in entity resolution using chromatic correlation clustering

CR Haruna, M Hou, MJ Eghan, MY Kpiebaareh… - Frontiers in Cyber …, 2018 - Springer
Entity resolution (ER) classifies records that refer to the same real-world entity and is
fundamental to data cleaning. Identifying approximate but not exact duplicates in database …