[HTML][HTML] Missing value imputation affects the performance of machine learning: A review and analysis of the literature (2010–2021)

MK Hasan, MA Alam, S Roy, A Dutta, MT Jawad… - Informatics in Medicine …, 2021 - Elsevier
Recently, numerous studies have been conducted on Missing Value Imputation (MVI),
intending the primary solution scheme for the datasets containing one or more missing …

Generative models for tabular data: A review

DK Kim, DH Ryu, Y Lee, DH Choi - Journal of Mechanical Science and …, 2024 - Springer
Generative design refers to a methodology that not only simulates the characteristics of a
given data or system but also creates artificial data for various purposes. It'sa significant …

Online semi-supervised learning with mix-typed streaming features

D Wu, S Zhuo, Y Wang, Z Chen, Y He - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Online learning with feature spaces that are not fixed but can vary over time renders a
seemingly flexible learning paradigm thus has drawn much attention. Unfortunately, two …

ECMO PAL: using deep neural networks for survival prediction in venoarterial extracorporeal membrane oxygenation

AF Stephens, M Šeman, A Diehl, D Pilcher… - Intensive Care …, 2023 - Springer
Purpose Venoarterial extracorporeal membrane oxygenation (VA-ECMO) is a complex and
high-risk life support modality used in severe cardiorespiratory failure. ECMO survival scores …

Multiple imputation via generative adversarial network for high-dimensional blockwise missing value problems

Z Dai, Z Bu, Q Long - 2021 20th IEEE International Conference …, 2021 - ieeexplore.ieee.org
Missing data are present in most real world problems and need careful handling to preserve
the prediction accuracy and statistical consistency in the downstream analysis. As the gold …

A comparative study of methods for estimating conditional Shapley values and when to use them

LHB Olsen, IK Glad, M Jullum, K Aas - arxiv preprint arxiv:2305.09536, 2023 - arxiv.org
Shapley values originated in cooperative game theory but are extensively used today as a
model-agnostic explanation framework to explain predictions made by complex machine …

The missing indicator method: From low to high dimensions

M Van Ness, TM Bosschieter… - Proceedings of the 29th …, 2023 - dl.acm.org
Missing data is common in applied data science, particularly for tabular data sets found in
healthcare, social sciences, and natural sciences. Most supervised learning methods only …

A diabetes prediction system based on incomplete fused data sources

Z Yuan, H Ding, G Chao, M Song, L Wang… - Machine Learning and …, 2023 - mdpi.com
In recent years, the diabetes population has grown younger. Therefore, it has become a key
problem to make a timely and effective prediction of diabetes, especially given a single data …

Online learning in variable feature spaces with mixed data

Y He, J Dong, BJ Hou, Y Wang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
This paper explores a new online learning problem where the data streams are generated
from an over-time varying feature space, in which the random variables are of mixed data …

Sparse data reconstruction, missing value and multiple imputation through matrix factorization

N Sengupta, M Udell, N Srebro… - Sociological …, 2023 - journals.sagepub.com
Social science approaches to missing values predict avoided, unrequested, or lost
information from dense data sets, typically surveys. The authors propose a matrix …