Real-world data: a brief review of the methods, applications, challenges and opportunities

F Liu, D Panagiotakos - BMC Medical Research Methodology, 2022‏ - Springer
Background The increased adoption of the internet, social media, wearable devices, e-
health services, and other technology-driven services in medicine and healthcare has led to …

Metaheuristic design of feedforward neural networks: A review of two decades of research

VK Ojha, A Abraham, V Snášel - Engineering Applications of Artificial …, 2017‏ - Elsevier
Over the past two decades, the feedforward neural network (FNN) optimization has been a
key interest among the researchers and practitioners of multiple disciplines. The FNN …

Comparison of text preprocessing methods

CP Chai - Natural Language Engineering, 2023‏ - cambridge.org
Text preprocessing is not only an essential step to prepare the corpus for modeling but also
a key area that directly affects the natural language processing (NLP) application results. For …

[كتاب][B] Data cleaning

IF Ilyas, X Chu - 2019‏ - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Artificial intelligence and big data in public health

K Benke, G Benke - International journal of environmental research and …, 2018‏ - mdpi.com
Artificial intelligence and automation are topics dominating global discussions on the future
of professional employment, societal change, and economic performance. In this paper, we …

Detecting and preventing cyber insider threats: A survey

L Liu, O De Vel, QL Han, J Zhang… - … Surveys & Tutorials, 2018‏ - ieeexplore.ieee.org
Information communications technology systems are facing an increasing number of cyber
security threats, the majority of which are originated by insiders. As insiders reside behind …

An empirical study on tensorflow program bugs

Y Zhang, Y Chen, SC Cheung, Y **ong… - Proceedings of the 27th …, 2018‏ - dl.acm.org
Deep learning applications become increasingly popular in important domains such as self-
driving systems and facial identity systems. Defective deep learning applications may lead to …

Data and information quality

C Batini, M Scannapieco - Cham, Switzerland: Springer International …, 2016‏ - Springer
This book is the result of a study path that started in 2006, when the two authors of this book
published the book Data Quality: Concepts, Methodologies and Techniques. After 8 years …

Blocking and filtering techniques for entity resolution: A survey

G Papadakis, D Skoutas, E Thanos… - ACM Computing Surveys …, 2020‏ - dl.acm.org
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that
correspond to the same real-world object. Due to its inherently quadratic complexity, a series …

Transforming big data into smart data: An insight on the use of the k‐nearest neighbors algorithm to obtain quality data

I Triguero, D García‐Gil, J Maillo… - … : Data Mining and …, 2019‏ - Wiley Online Library
The k‐nearest neighbors algorithm is characterized as a simple yet effective data mining
technique. The main drawback of this technique appears when massive amounts of data …