Data cleaning: Overview and emerging challenges

X Chu, IF Ilyas, S Krishnan, J Wang - Proceedings of the 2016 …, 2016 - dl.acm.org
Detecting and repairing dirty data is one of the perennial challenges in data analytics, and
failure to do so can result in inaccurate analytics and unreliable decisions. Over the past few …

[KÖNYV][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

[KÖNYV][B] Database repairing and consistent query answering

L Bertossi - 2011 - books.google.com
Integrity constraints are semantic conditions that a database should satisfy in order to be an
appropriate model of external reality. In practice, and for many reasons, a database may not …

Trends in cleaning relational data: Consistency and deduplication

IF Ilyas, X Chu - Foundations and Trends® in Databases, 2015 - nowpublishers.com
Data quality is one of the most important problems in data management, since dirty data
often leads to inaccurate data analytics results and wrong business decisions. Poor data …

Towards dependable data repairing with fixing rules

J Wang, N Tang - Proceedings of the 2014 ACM SIGMOD international …, 2014 - dl.acm.org
One of the main challenges that data cleaning systems face is to automatically identify and
repair data errors in a dependable manner. Though data dependencies (aka integrity …

Interaction between record matching and data repairing

W Fan, S Ma, N Tang, W Yu - Journal of Data and Information Quality …, 2014 - dl.acm.org
Central to a data cleaning system are record matching and data repairing. Matching aims to
identify tuples that refer to the same real-world object, and repairing is to make a database …

[PDF][PDF] 大数据的-个重要方面 数据可用性

**建中, 刘显敏 - 计算机研究与发展, 2013 - cs.sjtu.edu.cn
摘要!"# $% &'()*+,-.# $/0 123 4567893:;% &'<=>?@ ABCDEF GFHI# $8 J'KLMN
OPQRSTU@'VWIABXYZ [\],@ AB'KLVW^ _I!" AB'aZbc deABQ!^ fS ABXYZghiKjk l# $8 J …

Uncertain entity resolution: re-evaluating entity resolution in the big data era: tutorial

A Gal - Proceedings of the VLDB Endowment, 2014 - dl.acm.org
Entity resolution is a fundamental problem in data integration dealing with the combination
of data from different sources to a unified view of the data. Entity resolution is inherently an …

Qualitative data cleaning

X Chu, IF Ilyas - Proceedings of the VLDB Endowment, 2016 - dl.acm.org
Data quality is one of the most important problems in data management, since dirty data
often leads to inaccurate data analytics results and wrong business decisions. Data cleaning …

Database repairs and consistent query answering: Origins and further developments

L Bertossi - Proceedings of the 38th ACM SIGMOD-SIGACT-SIGAI …, 2019 - dl.acm.org
In this article we review the main concepts around database repairs and consistent query
answering, with emphasis on tracing back the origin, motivation, and early developments …