Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2025‏ - dl.acm.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Knowledge graphs

A Hogan, E Blomqvist, M Cochez, C d'Amato… - ACM Computing …, 2021‏ - dl.acm.org
In this article, we provide a comprehensive introduction to knowledge graphs, which have
recently garnered significant attention from both industry and academia in scenarios that …

“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI

N Sambasivan, S Kapania, H Highfill… - proceedings of the …, 2021‏ - dl.acm.org
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019‏ - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Datasheets for datasets

T Gebru, J Morgenstern, B Vecchione… - Communications of the …, 2021‏ - dl.acm.org
Datasheets for datasets Page 1 86 COMMUNICATIONS OF THE ACM | DECEMBER 2021 |
VOL. 64 | NO. 12 review articles DATA PLAYS A critical role in machine learning. Every …

Data lake management: challenges and opportunities

F Nargesian, E Zhu, RJ Miller, KQ Pu… - Proceedings of the VLDB …, 2019‏ - dl.acm.org
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …

How do data science workers collaborate? roles, workflows, and tools

AX Zhang, M Muller, D Wang - Proceedings of the ACM on Human …, 2020‏ - dl.acm.org
Today, the prominence of data science within organizations has given rise to teams of data
science workers collaborating on extracting insights from data, as opposed to individual data …

Data science: a comprehensive overview

L Cao - ACM Computing Surveys (CSUR), 2017‏ - dl.acm.org
The 21st century has ushered in the age of big data and data economy, in which data DNA,
which carries important knowledge, insights, and potential, has become an intrinsic …

The role of big data analytics in Internet of Things

E Ahmed, I Yaqoob, IAT Hashem, I Khan, AIA Ahmed… - Computer Networks, 2017‏ - Elsevier
The explosive growth in the number of devices connected to the Internet of Things (IoT) and
the exponential increase in data consumption only reflect how the growth of big data …

Automated machine learning: State-of-the-art and open challenges

R Elshawi, M Maher, S Sakr - arxiv preprint arxiv:1906.02287, 2019‏ - arxiv.org
With the continuous and vast increase in the amount of data in our digital world, it has been
acknowledged that the number of knowledgeable data scientists can not scale to address …