Overview and importance of data quality for machine learning tasks
It is well understood from literature that the performance of a machine learning (ML) model is
upper bounded by the quality of the data. While researchers and practitioners have focused …
upper bounded by the quality of the data. While researchers and practitioners have focused …
[PDF][PDF] A formal definition of data quality problems.
The exploration of data to extract information or knowledge to support decision making is a
critical success factor for an organization in today's society. However, several problems can …
critical success factor for an organization in today's society. However, several problems can …
Data smells: Categories, causes and consequences, and detection of suspicious data in ai-based systems
High data quality is fundamental for today's AI-based systems. However, although data
quality has been an object of research for decades, there is a clear lack of research on …
quality has been an object of research for decades, there is a clear lack of research on …
Using SPARQL and SPIN for data quality management on the semantic web
The quality of data is a key factor that determines the performance of information systems, in
particular with regard (1) to the amount of exceptions in the execution of business processes …
particular with regard (1) to the amount of exceptions in the execution of business processes …
Swiqa–a semantic web information quality assessment framework
The internet is currently evolving from the" Web of Documents" into the" Web of Data" where
data is available on web-scale in the so called Semantic Web (1) to retrieve information or …
data is available on web-scale in the so called Semantic Web (1) to retrieve information or …
[PDF][PDF] A survey of data quality tools.
Data quality tools aim at detecting and correcting data problems that affect the accuracy and
efficiency of data analysis applications. We propose a classification of the most relevant …
efficiency of data analysis applications. We propose a classification of the most relevant …
[PDF][PDF] Data freshness and data accuracy: A state of the art
V Peralta - Instituto de Computacion, Facultad de Ingenieria …, 2006 - fing.edu.uy
In a context of Data Integration Systems (DIS) providing access to large amounts of data
extracted and integrated from autonomous data sources, users are highly concerned about …
extracted and integrated from autonomous data sources, users are highly concerned about …
Bigqa: Declarative big data quality assessment
In the big data domain, data quality assessment operations are often complex and must be
implementable in a distributed and timely manner. This article tries to generalize the quality …
implementable in a distributed and timely manner. This article tries to generalize the quality …
Towards a vocabulary for data quality management in semantic web architectures
Reliable decision-making and reliable information based on Semantic Web data requires
methodologies and techniques for managing the quality of the published data. To make …
methodologies and techniques for managing the quality of the published data. To make …