Advances, challenges and opportunities in creating data for trustworthy AI
As artificial intelligence (AI) transitions from research to deployment, creating the appropriate
datasets and data pipelines to develop and evaluate AI models is increasingly the biggest …
datasets and data pipelines to develop and evaluate AI models is increasingly the biggest …
Data-centric artificial intelligence: A survey
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …
of its great success is the availability of abundant and high-quality data for building machine …
Deep neural networks and tabular data: A survey
Heterogeneous tabular data are the most commonly used form of data and are essential for
numerous critical and computationally demanding applications. On homogeneous datasets …
numerous critical and computationally demanding applications. On homogeneous datasets …
Machine learning testing: Survey, landscapes and horizons
This paper provides a comprehensive survey of techniques for testing machine learning
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …
A survey on data collection for machine learning: a big data-ai integration perspective
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …
multiple communities. There are largely two reasons data collection has recently become a …
Software engineering for machine learning: A case study
Recent advances in machine learning have stimulated widespread interest within the
Information Technology sector on integrating AI capabilities into software and services. This …
Information Technology sector on integrating AI capabilities into software and services. This …
Data collection and quality challenges in deep learning: A data-centric ai perspective
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …
learning becomes the new software, powered by big data and computing infrastructure …
The what-if tool: Interactive probing of machine learning models
A key challenge in develo** and deploying Machine Learning (ML) systems is
understanding their performance across a wide range of inputs. To address this challenge …
understanding their performance across a wide range of inputs. To address this challenge …
Towards accountability for machine learning datasets: Practices from software engineering and infrastructure
Datasets that power machine learning are often used, shared, and reused with little visibility
into the processes of deliberation that led to their creation. As artificial intelligence systems …
into the processes of deliberation that led to their creation. As artificial intelligence systems …
Software engineering for AI-based systems: a survey
AI-based systems are software systems with functionalities enabled by at least one AI
component (eg, for image-, speech-recognition, and autonomous driving). AI-based systems …
component (eg, for image-, speech-recognition, and autonomous driving). AI-based systems …