Fairness in machine learning: A survey

S Caton, C Haas - ACM Computing Surveys, 2024 - dl.acm.org
When Machine Learning technologies are used in contexts that affect citizens, companies as
well as researchers need to be confident that there will not be any unexpected social …

A survey of computer-aided tumor diagnosis based on convolutional neural network

Y Yan, XJ Yao, SH Wang, YD Zhang - Biology, 2021 - mdpi.com
Simple Summary One of the hottest areas in deep learning is computerized tumor diagnosis
and treatment. The identification of tumor markers, the outline of tumor growth activity, and …

[PDF][PDF] Tree boosting with xgboost-why does xgboost win" every" machine learning competition?

D Nielsen - 2016 - ntnuopen.ntnu.no
Tree boosting has empirically proven to be a highly effective approach to predictive
modeling. It has shown remarkable results for a vast array of problems. For many years …

A group-theoretic framework for data augmentation

S Chen, E Dobriban, JH Lee - Journal of Machine Learning Research, 2020 - jmlr.org
Data augmentation is a widely used trick when training deep neural networks: in addition to
the original data, properly transformed data are also added to the training set. However, to …

Supersparse linear integer models for optimized medical scoring systems

B Ustun, C Rudin - Machine Learning, 2016 - Springer
Scoring systems are linear classification models that only require users to add, subtract and
multiply a few small numbers in order to make a prediction. These models are in widespread …

The landscape of empirical risk for nonconvex losses

S Mei, Y Bai, A Montanari - The Annals of Statistics, 2018 - JSTOR
Most high-dimensional estimation methods propose to minimize a cost function (empirical
risk) that is a sum of losses associated to each data point (each example). In this paper, we …

Who should predict? exact algorithms for learning to defer to humans

H Mozannar, H Lang, D Wei… - International …, 2023 - proceedings.mlr.press
Automated AI classifiers should be able to defer the prediction to a human decision maker to
ensure more accurate predictions. In this work, we jointly train a classifier with a rejector …

Differentiable ranking and sorting using optimal transport

M Cuturi, O Teboul, JP Vert - Advances in neural …, 2019 - proceedings.neurips.cc
Sorting is used pervasively in machine learning, either to define elementary algorithms, such
as $ k $-nearest neighbors ($ k $-NN) rules, or to define test-time metrics, such as top-$ k …

Predictive multiplicity in classification

C Marx, F Calmon, B Ustun - International Conference on …, 2020 - proceedings.mlr.press
Prediction problems often admit competing models that perform almost equally well. This
effect challenges key assumptions in machine learning when competing models assign …

Development and validation of a machine learning model to identify patients before surgery at high risk for postoperative adverse events

A Mahajan, S Esper, TH Oo, J McKibben… - JAMA Network …, 2023 - jamanetwork.com
Importance Identifying patients at high risk of adverse outcomes prior to surgery may allow
for interventions associated with improved postoperative outcomes; however, few tools exist …