Data quality challenges in large-scale cyber-physical systems: A systematic review

J Bogner, R Verdecchia… - 2021 IEEE/ACM …, 2021‏ - ieeexplore.ieee.org
Background: With the rising popularity of Artificial Intelligence (AI), there is a growing need to
build large and complex AI-based systems in a cost-effective and manageable way. Like …

Data smells: categories, causes and consequences, and detection of suspicious data in AI-based systems

H Foidl, M Felderer, R Ramler - … of the 1st International Conference on …, 2022‏ - dl.acm.org
High data quality is fundamental for today's AI-based systems. However, although data
quality has been an object of research for decades, there is a clear lack of research on …

[HTML][HTML] BIGOWL4DQ: Ontology-driven approach for Big Data quality meta-modelling, selection and reasoning

C Barba-González, I Caballero, ÁJ Varela-Vaca… - Information and …, 2024‏ - Elsevier
Context: Data quality should be at the core of many Artificial Intelligence initiatives from the
very first moment in which data is required for a successful analysis. Measurement and …

Dqlearn: A toolkit for structured data quality learning

S Shrivastava, D Patel, N Zhou… - … Conference on Big …, 2020‏ - ieeexplore.ieee.org
Data Quality (DQ) has been one of the key focuses as Data Analytics and Artificial
Intelligence (AI) fields continue to grow. Yet, data quality analysis has mostly been a …

Enhancing data preparation: insights from a time series case study

C Sancricca, G Siracusa, C Cappiello - Journal of Intelligent Information …, 2024‏ - Springer
Data play a key role in AI systems that support decision-making processes. Data-centric AI
highlights the importance of having high-quality input data to obtain reliable results …

Interactive data cleaning for real-time streaming applications

T Räth, N Onah, KU Sattler - Proceedings of the Workshop on Human-In …, 2023‏ - dl.acm.org
The importance of data cleaning systems has continuously grown in recent years. Especially
for real-time streaming applications, it is crucial, to identify and possibly remove anomalies …

Data Quality Management in Large-Scale Cyber-Physical Systems

A Alwan - 2021‏ - repository.uel.ac.uk
Cyber-Physical Systems (CPSs) are cross-domain, multi-model, advance information
systems that play a significant role in many large-scale infrastructure sectors of smart cities …

DQDF: data-quality-aware dataframes

P Sinthong, D Patel, N Zhou, S Shrivastava… - Proceedings of the …, 2021‏ - dl.acm.org
Data quality assessment is an essential process of any data analysis process including
machine learning. The process is time-consuming as it involves multiple independent data …

IML4DQ: Interactive Machine Learning for Data Quality with Applications in Credit Risk

E Tiukhova, A Salcuni, C Oguz, F Forte… - International Conference …, 2024‏ - Springer
Data Quality (DQ) has gained popularity in recent years due to the increasing reliance on
data in machine learning (ML). The DQ domain itself can benefit from ML, which is able to …