Systematic review of advanced AI methods for improving healthcare data quality in post COVID-19 Era
At the beginning of the COVID-19 pandemic, there was significant hype about the potential
impact of artificial intelligence (AI) tools in combatting COVID-19 on diagnosis, prognosis, or …
impact of artificial intelligence (AI) tools in combatting COVID-19 on diagnosis, prognosis, or …
[HTML][HTML] Transformative strategies in photocatalyst design: merging computational methods and deep learning
J Liu, L Liang, B Su, D Wu, Y Zhang… - Journal of Materials …, 2024 - oaepublish.com
Photocatalysis is a unique technology that harnesses solar energy through in-situ
processes, operating without the need for external energy inputs. It is integral to advancing …
processes, operating without the need for external energy inputs. It is integral to advancing …
Goodcore: Data-effective and data-efficient machine learning through coreset selection over incomplete data
Given a dataset with incomplete data (eg, missing values), training a machine learning
model over the incomplete data requires two steps. First, it requires a data-effective step that …
model over the incomplete data requires two steps. First, it requires a data-effective step that …
An integrated network architecture for data repair and degradation trend prediction
Q Yang, B Tang, S Yang, Y Shen - Mechanical Systems and Signal …, 2023 - Elsevier
This paper proposed a network framework, namely DR-DTPN, which integrates data repair
and degradation trend prediction to address the serious deviation of equipment degradation …
and degradation trend prediction to address the serious deviation of equipment degradation …
Parker: Data fusion through consistent repairs using edit rules under partial keys
Data integration is the problem of consolidating information provided by multiple sources.
After schema map** and duplicate detection have been dealt with, the problem consists in …
After schema map** and duplicate detection have been dealt with, the problem consists in …
Data cleaning and machine learning: a systematic literature review
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …
applications. Because the performance of an ML model is highly dependent on the quality of …
RLclean: An unsupervised integrated data cleaning framework based on deep reinforcement learning
Data cleaning, a prerequisite to subsequent data analysis, has always been the focus of
data science research. Datasets with errors can severely detract from the quality of …
data science research. Datasets with errors can severely detract from the quality of …
Lincqa: Faster consistent query answering with linear time guarantees
Most data analytical pipelines often encounter the problem of querying inconsistent data that
violate pre-determined integrity constraints. Data cleaning is an extensively studied …
violate pre-determined integrity constraints. Data cleaning is an extensively studied …
Automatic data repair: Are we ready to deploy?
Data quality is paramount in today's data-driven world, especially in the era of generative AI.
Dirty data with errors and inconsistencies usually leads to flawed insights, unreliable …
Dirty data with errors and inconsistencies usually leads to flawed insights, unreliable …
[HTML][HTML] Natural generative noise diffusion model imputation
Imputation is a critical method for enhancing dataset quality, essential for ensuring accurate
analysis and insights. This research proposes an advanced imputation algorithm utilizing a …
analysis and insights. This research proposes an advanced imputation algorithm utilizing a …