Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2025 - dl.acm.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Knowledge graphs

A Hogan, E Blomqvist, M Cochez, C d'Amato… - ACM Computing …, 2021 - dl.acm.org
In this article, we provide a comprehensive introduction to knowledge graphs, which have
recently garnered significant attention from both industry and academia in scenarios that …

“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI

N Sambasivan, S Kapania, H Highfill… - proceedings of the …, 2021 - dl.acm.org
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Datasheets for datasets

T Gebru, J Morgenstern, B Vecchione… - Communications of the …, 2021 - dl.acm.org
Datasheets for datasets Page 1 86 COMMUNICATIONS OF THE ACM | DECEMBER 2021 |
VOL. 64 | NO. 12 review articles DATA PLAYS A critical role in machine learning. Every …

How do data science workers collaborate? roles, workflows, and tools

AX Zhang, M Muller, D Wang - Proceedings of the ACM on Human …, 2020 - dl.acm.org
Today, the prominence of data science within organizations has given rise to teams of data
science workers collaborating on extracting insights from data, as opposed to individual data …

The role of big data analytics in Internet of Things

E Ahmed, I Yaqoob, IAT Hashem, I Khan, AIA Ahmed… - Computer Networks, 2017 - Elsevier
The explosive growth in the number of devices connected to the Internet of Things (IoT) and
the exponential increase in data consumption only reflect how the growth of big data …

Data science: a comprehensive overview

L Cao - ACM Computing Surveys (CSUR), 2017 - dl.acm.org
The 21st century has ushered in the age of big data and data economy, in which data DNA,
which carries important knowledge, insights, and potential, has become an intrinsic …

Data lake management: challenges and opportunities

F Nargesian, E Zhu, RJ Miller, KQ Pu… - Proceedings of the VLDB …, 2019 - dl.acm.org
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …

How ai developers overcome communication challenges in a multidisciplinary team: A case study

D Piorkowski, S Park, AY Wang, D Wang… - Proceedings of the …, 2021 - dl.acm.org
The development of AI applications is a multidisciplinary effort, involving multiple roles
collaborating with the AI developers, an umbrella term we use to include data scientists and …