A survey on provenance: What for? What form? What from?

M Herschel, R Diestelkämper, H Ben Lahmar - The VLDB Journal, 2017 - Springer
Provenance refers to any information describing the production process of an end product,
which can be anything from a piece of digital data to a physical object. While this survey …

Query by output

QT Tran, CY Chan, S Parthasarathy - Proceedings of the 2009 ACM …, 2009 - dl.acm.org
It has recently been asserted that the usability of a database is as important as its capability.
Understanding the database schema, the hidden relationships among attributes in the data …

Data provenance

B Glavic - Foundations and Trends® in Databases, 2021 - nowpublishers.com
Data provenance has evolved from a niche topic to a mainstream area of research in
databases and other research communities. This article gives a comprehensive introduction …

Messing up with BART: error generation for evaluating data-cleaning algorithms

PC Arocena, B Glavic, G Mecca, RJ Miller… - Proceedings of the …, 2015 - dl.acm.org
We study the problem of introducing errors into clean databases for the purpose of
benchmarking data-cleaning algorithms. Our goal is to provide users with the highest …

Answering why-not questions on top-k queries

Z He, E Lo - IEEE Transactions on Knowledge and Data …, 2012 - ieeexplore.ieee.org
After decades of effort working on database performance, the quality and the usability of
database systems have received more attention in recent years. In particular, the feature of …

Efficient sampling for big provenance

S Moshtaghi Largani, S Lee - Companion Proceedings of the ACM Web …, 2023 - dl.acm.org
Provenance has been studied extensively to explain existing and missing results for many
applications while focusing on scalability and usability challenges. Recently, techniques that …

Query-oriented data cleaning with oracles

M Bergman, T Milo, S Novgorodov… - Proceedings of the 2015 …, 2015 - dl.acm.org
As key decisions are often made based on information contained in a database, it is
important for the database to be as complete and correct as possible. For this reason, many …

Query-based why-not provenance with nedexplain

N Bidoit, M Herschel, K Tzompanaki - … database technology (EDBT), 2014 - inria.hal.science
With the increasing amount of available data and transformations manipulating the data, it
has become essential to analyze and debug data transformations. A sub-problem of data …

Why not yet: Fixing a top-k ranking that is not fair to individuals

Z Chen, P Manolios, M Riedewald - Proceedings of the VLDB …, 2023 - dl.acm.org
This work considers why-not questions in the context of top-k queries and score-based
ranking functions. Following the popular linear scalarization approach for multi-objective …

Answering why-not questions on spatial keyword top-k queries

L Chen, X Lin, H Hu, CS Jensen… - 2015 IEEE 31st …, 2015 - ieeexplore.ieee.org
Large volumes of geo-tagged text objects are available on the web. Spatial keyword top-k
queries retrieve k such objects with the best score according to a ranking function that takes …