Scalable deep learning on distributed infrastructures: Challenges, techniques, and tools

R Mayer, HA Jacobsen - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Deep Learning (DL) has had an immense success in the recent past, leading to state-of-the-
art results in various domains, such as image recognition and natural language processing …

Data management for machine learning: A survey

C Chai, J Wang, Y Luo, Z Niu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Machine learning (ML) has widespread applications and has revolutionized many
industries, but suffers from several challenges. First, sufficient high-quality training data is …

The art and practice of data science pipelines: A comprehensive study of data science pipelines in theory, in-the-small, and in-the-large

S Biswas, M Wardat, H Rajan - … of the 44th International Conference on …, 2022 - dl.acm.org
Increasingly larger number of software systems today are including data science
components for descriptive, predictive, and prescriptive analytics. The collection of data …