A survey on missing data in machine learning

T Emmanuel, T Maupong, D Mpoeleng, T Semong… - Journal of Big …, 2021 - Springer
Abstract Machine learning has been the corner stone in analysing and extracting information
from data and often a problem of missing values is encountered. Missing values occur …

Missing value imputation: a review and analysis of the literature (2006–2017)

WC Lin, CF Tsai - Artificial Intelligence Review, 2020 - Springer
Missing value imputation (MVI) has been studied for several decades being the basic
solution method for incomplete dataset problems, specifically those where some data …

Efficient kNN classification with different numbers of nearest neighbors

S Zhang, X Li, M Zong, X Zhu… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
k nearest neighbor (kNN) method is a popular classification method in data mining and
statistics because of its simple implementation and significant classification performance …

Learning k for kNN Classification

S Zhang, X Li, M Zong, X Zhu, D Cheng - ACM Transactions on …, 2017 - dl.acm.org
The K Nearest Neighbor (kNN) method has widely been used in the applications of data
mining and machine learning due to its simple implementation and distinguished …

Human digital twin for fitness management

BR Barricelli, E Casiraghi, J Gliozzo, A Petrini… - Ieee …, 2020 - ieeexplore.ieee.org
Our research work describes a team of human Digital Twins (DTs), each tracking fitness-
related measurements describing an athlete's behavior in consecutive days (eg food …

A novel kNN algorithm with data-driven k parameter computation

S Zhang, D Cheng, Z Deng, M Zong, X Deng - Pattern Recognition Letters, 2018 - Elsevier
This paper studies an example-driven k-parameter computation that identifies different k
values for different test samples in kNN prediction applications, such as classification …

Nearest neighbor selection for iteratively kNN imputation

S Zhang - Journal of Systems and Software, 2012 - Elsevier
Existing kNN imputation methods for dealing with missing data are designed according to
Minkowski distance or its variants, and have been shown to be generally efficient for …

Missing value estimation for mixed-attribute data sets

X Zhu, S Zhang, Z **, Z Zhang… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Missing data imputation is a key issue in learning from incomplete data. Various techniques
have been developed with great successes on dealing with missing values in data sets with …

Hybrid prediction model with missing value imputation for medical data

A Purwar, SK Singh - Expert Systems with Applications, 2015 - Elsevier
Accurate prediction in the presence of large number of missing values in the data set has
always been a challenging problem. Most of hybrid models to address this challenge have …

Graph self-representation method for unsupervised feature selection

R Hu, X Zhu, D Cheng, W He, Y Yan, J Song, S Zhang - Neurocomputing, 2017 - Elsevier
Both subspace learning methods and feature selection methods are often used for removing
irrelative features from high-dimensional data. Studies have shown that feature selection …