Machine learning for ultrasonic nondestructive examination of welding defects: A systematic review

H Sun, P Ramuhalli, RE Jacob - Ultrasonics, 2023 - Elsevier
Recent years have seen a substantial increase in the application of machine learning (ML)
for automated analysis of nondestructive examination (NDE) data. One of the applications of …

Opportunities and challenges in data-centric AI

S Kumar, S Datta, V Singh, SK Singh, R Sharma - IEEE Access, 2024 - ieeexplore.ieee.org
Artificial intelligence (AI) systems are trained to solve complex problems and learn to
perform specific tasks by using large volumes of data, such as prediction, classification …

Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

The effects of data quality on machine learning performance

L Budach, M Feuerpfeil, N Ihde, A Nathansen… - arxiv preprint arxiv …, 2022 - arxiv.org
Modern artificial intelligence (AI) applications require large quantities of training and test
data. This need creates critical challenges not only concerning the availability of such data …

A benchmark for data imputation methods

S Jäger, A Allhorn, F Bießmann - Frontiers in big Data, 2021 - frontiersin.org
With the increasing importance and complexity of data pipelines, data quality became one of
the key challenges in modern software applications. The importance of data quality has …

[HTML][HTML] A hybrid machine learning approach for the load prediction in the sustainable transition of district heating networks

M Habib, TO Timoudas, Y Ding, N Nord, S Chen… - Sustainable cities and …, 2023 - Elsevier
Current district heating networks are undergoing a sustainable transition towards the 4 th
and 5 th generation of district heating networks, characterized by the integration of different …

Data smells: categories, causes and consequences, and detection of suspicious data in AI-based systems

H Foidl, M Felderer, R Ramler - … of the 1st International Conference on …, 2022 - dl.acm.org
High data quality is fundamental for today's AI-based systems. However, although data
quality has been an object of research for decades, there is a clear lack of research on …

[HTML][HTML] Data pipeline quality: Influencing factors, root causes of data-related issues, and processing problem areas for developers

H Foidl, V Golendukhina, R Ramler… - Journal of Systems and …, 2024 - Elsevier
Data pipelines are an integral part of various modern data-driven systems. However, despite
their importance, they are often unreliable and deliver poor-quality data. A critical step …

Exploring needs and challenges for AI in nursing care–results of an explorative sequential mixed methods study

K Seibert, D Domhoff, D Fürstenau, F Biessmann… - BMC Digital Health, 2023 - Springer
Background and aim While artificial intelligence (AI) is being adapted for various life
domains and applications related to medicine and healthcare, the use of AI in nursing …

Automatic and precise data validation for machine learning

S Shankar, L Fawaz, K Gyllstrom… - Proceedings of the 32nd …, 2023 - dl.acm.org
Machine learning (ML) models in production pipelines are frequently retrained on the latest
partitions of large, continually-growing datasets. Due to engineering bugs, partitions in such …