Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2025 - dl.acm.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Data-centric ai: Perspectives and challenges

D Zha, ZP Bhat, KH Lai, F Yang, X Hu - Proceedings of the 2023 SIAM …, 2023 - SIAM
The role of data in building AI systems has recently been significantly magnified by the
emerging concept of data-centric AI (DCAI), which advocates a fundamental shift from model …

Bring your own view: Graph neural networks for link prediction with personalized subgraph selection

Q Tan, X Zhang, N Liu, D Zha, L Li, R Chen… - Proceedings of the …, 2023 - dl.acm.org
Graph neural networks (GNNs) have received remarkable success in link prediction
(GNNLP) tasks. Existing efforts first predefine the subgraph for the whole dataset and then …

Active ensemble learning for knowledge graph error detection

J Dong, Q Zhang, X Huang, Q Tan, D Zha… - Proceedings of the …, 2023 - dl.acm.org
Knowledge graphs (KGs) could effectively integrate a large number of real-world assertions,
and improve the performance of various applications, such as recommendation and search …

Surco: Learning linear surrogates for combinatorial nonlinear optimization problems

AM Ferber, T Huang, D Zha… - International …, 2023 - proceedings.mlr.press
Optimization problems with nonlinear cost functions and combinatorial constraints appear in
many real-world applications but remain challenging to solve efficiently compared to their …

Optimizing cpu performance for recommendation systems at-scale

R Jain, S Cheng, V Kalagi, V Sanghavi, S Kaul… - Proceedings of the 50th …, 2023 - dl.acm.org
Deep Learning Recommendation Models (DLRMs) are very popular in personalized
recommendation systems and are a major contributor to the data-center AI cycles. Due to the …

Rsc: accelerate graph neural networks training via randomized sparse computations

Z Liu, C Shengyuan, K Zhou, D Zha… - International …, 2023 - proceedings.mlr.press
Training graph neural networks (GNNs) is extremely time consuming because sparse graph-
based operations are hard to be accelerated by community hardware. Prior art successfully …

Collaborative graph neural networks for attributed network embedding

Q Tan, X Zhang, X Huang, H Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Graph neural networks (GNNs) have shown prominent performance on attributed network
embedding. However, existing efforts mainly focus on exploiting network structures, while …

Mp-rec: Hardware-software co-design to enable multi-path recommendation

S Hsia, U Gupta, B Acun, N Ardalani, P Zhong… - Proceedings of the 28th …, 2023 - dl.acm.org
Deep learning recommendation systems serve personalized content under diverse tail-
latency targets and input-query loads. In order to do so, state-of-the-art recommendation …

Rap: Resource-aware automated gpu sharing for multi-gpu recommendation model training and input preprocessing

Z Wang, Y Wang, J Deng, D Zheng, A Li… - Proceedings of the 29th …, 2024 - dl.acm.org
Ensuring high-quality recommendations for newly onboarded users requires the continuous
retraining of Deep Learning Recommendation Models (DLRMs) with freshly generated data …