Overview and importance of data quality for machine learning tasks

A Jain, H Patel, L Nagalapatti, N Gupta… - Proceedings of the 26th …, 2020 - dl.acm.org
It is well understood from literature that the performance of a machine learning (ML) model is
upper bounded by the quality of the data. While researchers and practitioners have focused …

[PDF][PDF] A formal definition of data quality problems.

P Oliveira, F Rodrigues, PR Henriques - ICIQ, 2005 - researchgate.net
The exploration of data to extract information or knowledge to support decision making is a
critical success factor for an organization in today's society. However, several problems can …

Data smells: Categories, causes and consequences, and detection of suspicious data in ai-based systems

H Foidl, M Felderer, R Ramler - … of the 1st International Conference on …, 2022 - dl.acm.org
High data quality is fundamental for today's AI-based systems. However, although data
quality has been an object of research for decades, there is a clear lack of research on …

Using SPARQL and SPIN for data quality management on the semantic web

C Fürber, M Hepp - … Systems: 13th International Conference, BIS 2010 …, 2010 - Springer
The quality of data is a key factor that determines the performance of information systems, in
particular with regard (1) to the amount of exceptions in the execution of business processes …

Swiqa–a semantic web information quality assessment framework

C Fürber, M Hepp - 2011 - aisel.aisnet.org
The internet is currently evolving from the" Web of Documents" into the" Web of Data" where
data is available on web-scale in the so called Semantic Web (1) to retrieve information or …

[PDF][PDF] A survey of data quality tools.

J Barateiro, H Galhardas - Datenbank-Spektrum, 2005 - Citeseer
Data quality tools aim at detecting and correcting data problems that affect the accuracy and
efficiency of data analysis applications. We propose a classification of the most relevant …

[書籍][B] Semantic technologies

C Fürber, C Fürber - 2016 - Springer
As discussed in section 2.1 of this thesis we regard semantic technologies “as technical
approaches that facilitate or make use of the interpretation of meaning by machines” …

[PDF][PDF] Data freshness and data accuracy: A state of the art

V Peralta - Instituto de Computacion, Facultad de Ingenieria …, 2006 - fing.edu.uy
In a context of Data Integration Systems (DIS) providing access to large amounts of data
extracted and integrated from autonomous data sources, users are highly concerned about …

Bigqa: Declarative big data quality assessment

H Fadlallah, R Kilany, H Dhayne, R El Haddad… - ACM Journal of Data …, 2023 - dl.acm.org
In the big data domain, data quality assessment operations are often complex and must be
implementable in a distributed and timely manner. This article tries to generalize the quality …

Towards a vocabulary for data quality management in semantic web architectures

C Fürber, M Hepp - Proceedings of the 1st International Workshop on …, 2011 - dl.acm.org
Reliable decision-making and reliable information based on Semantic Web data requires
methodologies and techniques for managing the quality of the published data. To make …