Missing value imputation: a review and analysis of the literature (2006–2017)

WC Lin, CF Tsai - Artificial Intelligence Review, 2020 - Springer
Missing value imputation (MVI) has been studied for several decades being the basic
solution method for incomplete dataset problems, specifically those where some data …

[HTML][HTML] Missing value imputation affects the performance of machine learning: A review and analysis of the literature (2010–2021)

MK Hasan, MA Alam, S Roy, A Dutta, MT Jawad… - Informatics in Medicine …, 2021 - Elsevier
Recently, numerous studies have been conducted on Missing Value Imputation (MVI),
intending the primary solution scheme for the datasets containing one or more missing …

Random forest missing data algorithms

F Tang, H Ishwaran - Statistical Analysis and Data Mining: The …, 2017 - Wiley Online Library
Random forest (RF) missing data algorithms are an attractive approach for imputing missing
data. They have the desirable properties of being able to handle mixed types of missing …

Nearest neighbor imputation algorithms: a critical evaluation

L Beretta, A Santaniello - BMC medical informatics and decision making, 2016 - Springer
Background Nearest neighbor (NN) imputation algorithms are efficient methods to fill in
missing data where each missing value on some records is replaced by a value obtained …

Application of optimized machine learning techniques for prediction of occupational accidents

S Sarkar, S Vinay, R Raj, J Maiti, P Mitra - Computers & Operations …, 2019 - Elsevier
Although, the usefulness of the machine learning (ML) technique in predicting future
outcomes has been established in different domains of applications (eg, heath care), its …

Artificial intelligence (AI) and machine learning (ML) in precision oncology: a review on enhancing discoverability through multiomics integration

L Wei, D Niraula, EDH Gates, J Fu, Y Luo… - The British Journal of …, 2023 - academic.oup.com
Multiomics data including imaging radiomics and various types of molecular biomarkers
have been increasingly investigated for better diagnosis and therapy in the era of precision …

Multiple imputation using nearest neighbor methods

S Faisal, G Tutz - Information Sciences, 2021 - Elsevier
Missing values are a major problem in medical research. As the complete case analysis
discards useful information, estimation and inference may suffer strongly. Multiple imputation …

Handling complex missing data using random forest approach for an air quality monitoring dataset: a case study of Kuwait environmental data (2012 to 2018)

AR Alsaber, J Pan, A Al-Hurban - International Journal of Environmental …, 2021 - mdpi.com
In environmental research, missing data are often a challenge for statistical modeling. This
paper addressed some advanced techniques to deal with missing values in a data set …

Consumption of coffee and tea with all-cause and cause-specific mortality: a prospective cohort study

Y Chen, Y Zhang, M Zhang, H Yang, Y Wang - BMC medicine, 2022 - Springer
Background Previous studies suggested that moderate coffee and tea consumption are
associated with lower risk of mortality. However, the association between the combination of …

Multiple imputation for general missing data patterns in the presence of high-dimensional data

Y Deng, C Chang, MS Ido, Q Long - Scientific reports, 2016 - nature.com
Multiple imputation (MI) has been widely used for handling missing data in biomedical
research. In the presence of high-dimensional data, regularized regression has been used …