A software engineering perspective on engineering machine learning systems: State of the art and challenges

G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …

AUC maximization in the era of big data and AI: A survey

T Yang, Y Ying - ACM Computing Surveys, 2022 - dl.acm.org
Area under the ROC curve, aka AUC, is a measure of choice for assessing the performance
of a classifier for imbalanced data. AUC maximization refers to a learning paradigm that …

DeepFM: a factorization-machine based neural network for CTR prediction

H Guo, R Tang, Y Ye, Z Li, X He - arxiv preprint arxiv:1703.04247, 2017 - arxiv.org
Learning sophisticated feature interactions behind user behaviors is critical in maximizing
CTR for recommender systems. Despite great progress, existing methods seem to have a …

Deep interest network for click-through rate prediction

G Zhou, X Zhu, C Song, Y Fan, H Zhu, X Ma… - Proceedings of the 24th …, 2018 - dl.acm.org
Click-through rate prediction is an essential task in industrial applications, such as online
advertising. Recently deep learning based models have been proposed, which follow a …

xdeepfm: Combining explicit and implicit feature interactions for recommender systems

J Lian, X Zhou, F Zhang, Z Chen, X **e… - Proceedings of the 24th …, 2018 - dl.acm.org
Combinatorial features are essential for the success of many commercial models. Manually
crafting these features usually comes with high cost due to the variety, volume and velocity …

Scaling distributed machine learning with the parameter server

M Li, DG Andersen, JW Park, AJ Smola… - … USENIX Symposium on …, 2014 - usenix.org
We propose a parameter server framework for distributed machine learning problems. Both
data and workloads are distributed over worker nodes, while the server nodes maintain …

Autoint: Automatic feature interaction learning via self-attentive neural networks

W Song, C Shi, Z **ao, Z Duan, Y Xu, M Zhang… - Proceedings of the 28th …, 2019 - dl.acm.org
Click-through rate (CTR) prediction, which aims to predict the probability of a user clicking
on an ad or an item, is critical to many online applications such as online advertising and …

Hidden technical debt in machine learning systems

D Sculley, G Holt, D Golovin… - Advances in neural …, 2015 - proceedings.neurips.cc
Abstract Machine learning offers a fantastically powerful toolkit for building useful
complexprediction systems quickly. This paper argues it is dangerous to think ofthese quick …

When do neural nets outperform boosted trees on tabular data?

D McElfresh, S Khandagale… - Advances in …, 2024 - proceedings.neurips.cc
Tabular data is one of the most commonly used types of data in machine learning. Despite
recent advances in neural nets (NNs) for tabular data, there is still an active discussion on …

Visual analytics in deep learning: An interrogative survey for the next frontiers

F Hohman, M Kahng, R Pienta… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Deep learning has recently seen rapid development and received significant attention due
to its state-of-the-art performance on previously-thought hard problems. However, because …