Profiling relational data: a survey

Z Abedjan, L Golab, F Naumann - The VLDB Journal, 2015‏ - Springer
Profiling data to determine metadata about a given dataset is an important and frequent
activity of any IT professional and researcher and is necessary for various use-cases. It …

Data profiling: A tutorial

Z Abedjan, L Golab, F Naumann - Proceedings of the 2017 ACM …, 2017‏ - dl.acm.org
is to understand the dataset at hand and its metadata. The process of metadata discovery is
known as data profiling. Profiling activities range from ad-hoc approaches, such as eye …

Assessing and remedying coverage for a given dataset

A Asudeh, Z **, HV Jagadish - 2019 IEEE 35th International …, 2019‏ - ieeexplore.ieee.org
Data analysis impacts virtually every aspect of our society today. Often, this analysis is
performed on an existing dataset, possibly collected through a process that the data …

Data dependencies for query optimization: a survey

J Kossmann, T Papenbrock, F Naumann - The VLDB Journal, 2022‏ - Springer
Effective query optimization is a core feature of any database management system. While
most query optimization techniques make use of simple metadata, such as cardinalities and …

Efficient discovery of approximate dependencies

S Kruse, F Naumann - Proceedings of the VLDB Endowment, 2018‏ - dl.acm.org
Functional dependencies (FDs) and unique column combinations (UCCs) form a valuable
ingredient for many data management tasks, such as data cleaning, schema recovery, and …

Data profiling with metanome

T Papenbrock, T Bergmann, M Finke… - Proceedings of the …, 2015‏ - dl.acm.org
Data profiling is the discipline of discovering metadata about given datasets. The metadata
itself serve a variety of use cases, such as data integration, data cleansing, or query …

Discovery of approximate (and exact) denial constraints

EHM Pena, EC De Almeida, F Naumann - Proceedings of the VLDB …, 2019‏ - dl.acm.org
Maintaining data consistency is known to be hard. Recent approaches have relied on
integrity constraints to deal with the problem-correct and complete constraints naturally work …

Interactive and deterministic data cleaning

J He, E Veltri, D Santoro, G Li, G Mecca… - Proceedings of the …, 2016‏ - dl.acm.org
We present Falcon, an interactive, deterministic, and declarative data cleaning system,
which uses SQL update queries as the language to repair data. Falcon does not rely on the …

Efficient denial constraint discovery with hydra

T Bleifuß, S Kruse, F Naumann - Proceedings of the VLDB Endowment, 2017‏ - dl.acm.org
Denial constraints (DCs) are a generalization of many other integrity constraints (ICs) widely
used in databases, such as key constraints, functional dependencies, or order …

Protecting data integrity of web applications with database constraints inferred from application code

H Huang, B Shen, L Zhong, Y Zhou - Proceedings of the 28th ACM …, 2023‏ - dl.acm.org
Database-backed web applications persist a large amount of production data and have high
requirements for integrity. To protect data integrity against application code bugs and …