Advances, challenges and opportunities in creating data for trustworthy AI

W Liang, GA Tadesse, D Ho, L Fei-Fei… - Nature Machine …, 2022 - nature.com
As artificial intelligence (AI) transitions from research to deployment, creating the appropriate
datasets and data pipelines to develop and evaluate AI models is increasingly the biggest …

Advanced battery management strategies for a sustainable energy future: Multilayer design concepts and research trends

H Dai, B Jiang, X Hu, X Lin, X Wei, M Pecht - Renewable and Sustainable …, 2021 - Elsevier
Lithium-ion batteries are promising energy storage devices for electric vehicles and
renewable energy systems. However, due to complex electrochemical processes, potential …

“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI

N Sambasivan, S Kapania, H Highfill… - proceedings of the …, 2021 - dl.acm.org
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …

Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

Machine learning testing: Survey, landscapes and horizons

JM Zhang, M Harman, L Ma… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper provides a comprehensive survey of techniques for testing machine learning
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Automl to date and beyond: Challenges and opportunities

SK Karmaker, MM Hassan, MJ Smith, L Xu… - ACM Computing …, 2021 - dl.acm.org
As big data becomes ubiquitous across domains, and more and more stakeholders aspire to
make the most of their data, demand for machine learning tools has spurred researchers to …

Benchmark and survey of automated machine learning frameworks

MA Zöller, MF Huber - Journal of artificial intelligence research, 2021 - jair.org
Abstract Machine learning (ML) has become a vital part in many aspects of our daily life.
However, building well performing machine learning applications requires highly …

[書籍][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Tfx: A tensorflow-based production-scale machine learning platform

D Baylor, E Breck, HT Cheng, N Fiedel… - Proceedings of the 23rd …, 2017 - dl.acm.org
Creating and maintaining a platform for reliably producing and deploying machine learning
models requires careful orchestration of many components---a learner for generating …