Anonymization techniques for privacy preserving data publishing: A comprehensive survey

A Majeed, S Lee - IEEE access, 2020 - ieeexplore.ieee.org
Anonymization is a practical solution for preserving user's privacy in data publishing. Data
owners such as hospitals, banks, social network (SN) service providers, and insurance …

[HTML][HTML] Survey: Time-series data preprocessing: A survey and an empirical analysis

A Tawakuli, B Havers, V Gulisano, D Kaiser… - Journal of Engineering …, 2024 - Elsevier
Data are naturally collected in their raw state and must undergo a series of preprocessing
steps to obtain data in their input state for Artificial Intelligence (AI) and other applications …

Time series data cleaning: A survey

X Wang, C Wang - Ieee Access, 2019 - ieeexplore.ieee.org
Errors are prevalent in time series data, which is particularly common in the industrial field.
Data with errors could not be stored in the database, which results in the loss of data assets …

[PDF][PDF] A Review of Data Cleaning Methods for Web Information System.

J Wang, X Wang, Y Yang… - … , Materials & Continua, 2020 - pdfs.semanticscholar.org
Web information system (WIS) is frequently-used and indispensable in daily social life. WIS
provides information services in many scenarios, such as electronic commerce …

[HTML][HTML] cleanTS: Automated (AutoML) tool to clean univariate time series at microscales

MK Shende, AE Feijoo-Lorenzo, ND Bokde - Neurocomputing, 2022 - Elsevier
Data cleaning is one of the most important tasks in data analysis processes. One of the
perennial challenges in data analytics is the detection and handling of non-valid data …

Entity matching with auc-based fairness

S Nilforoushan, Q Wu, M Milani - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
The research on fair machine learning (ML) has been growing due to the high demand for
unbiased and fair ML models for objective decision-making. Most of this research has been …

Privacy-aware data cleaning-as-a-service

Y Huang, M Milani, F Chiang - Information Systems, 2020 - Elsevier
Data cleaning is a pervasive problem for organizations as they try to reap value from their
data. Recent advances in networking and cloud computing technology have fueled a new …

Contextual data cleaning with ontology functional dependencies

Z Zheng, L Zheng, M Alipourlangouri… - ACM Journal of Data …, 2022 - dl.acm.org
Functional Dependencies define attribute relationships based on syntactic equality, and
when used in data cleaning, they erroneously label syntactically different but semantically …

Leveraging currency for repairing inconsistent and incomplete data

X Ding, H Wang, J Su, M Wang, J Li… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Data quality plays a key role in big data management today. With the explosive growth of
data from a variety of sources, the quality of data is faced with multiple problems. Motivated …

Differentially Private k-Nearest Neighbor Missing Data Imputation

C Clifton, EJ Hanson, K Merrill, S Merrill - ACM transactions on privacy …, 2022 - dl.acm.org
Using techniques employing smooth sensitivity, we develop a method for-nearest neighbor
missing data imputation with differential privacy. This requires bounding the number of data …