Special issue on feature engineering editorial

T Verdonck, B Baesens, M Óskarsdóttir… - Machine learning, 2024 - Springer
In order to improve the performance of any machine learning model, it is important to focus
more on the data itself instead of continuously develo** new algorithms. This is exactly the …

Data engineering for fraud detection

B Baesens, S Höppner, T Verdonck - Decision Support Systems, 2021 - Elsevier
Financial institutions increasingly rely upon data-driven methods for develo** fraud
detection systems, which are able to automatically detect and block fraudulent transactions …

Estimating the contamination factor's distribution in unsupervised anomaly detection

L Perini, PC Bürkner, A Klami - International Conference on …, 2023 - proceedings.mlr.press
Anomaly detection methods identify examples that do not follow the expected behaviour,
typically in an unsupervised fashion, by assigning real-valued anomaly scores to the …

An agile approach to identify single and hybrid normalization for enhancing machine learning-based network intrusion detection

MA Siddiqi, W Pak - IEEE Access, 2021 - ieeexplore.ieee.org
Detecting intrusion in network traffic has remained a problematic task for years. Progress in
the field of machine learning is paving the way for enhancing intrusion detection systems …

A robust bootstrap test for mediation analysis

A Alfons, NY Ateş, PJF Groenen - Organizational Research …, 2022 - journals.sagepub.com
Mediation analysis is central to theory building and testing in organizational sciences.
Scholars often use linear regression analysis based on normal-theory maximum likelihood …

Develo** relative spatial poverty index using integrated remote sensing and geospatial big data approach: A case study of east java, Indonesia

SR Putri, AW Wijayanto, AD Sakti - ISPRS International Journal of Geo …, 2022 - mdpi.com
Poverty data are usually collected through on-the-ground household-based socioeconomic
surveys. Unfortunately, data collection with such conventional methods is expensive …

Tier-based optimization for synthesized network intrusion detection system

MA Siddiqi, W Pak - IEEE Access, 2022 - ieeexplore.ieee.org
The innovation and evolution of hacking methodologies have led to a sharp rise in cyber
attacks, highlighting the need for enhanced network security approaches. Network intrusion …

Machine learning-based global air quality index development using remote sensing and ground-based stations

TS Anggraini, H Irie, AD Sakti, K Wikantika - Environmental Advances, 2024 - Elsevier
Air pollution refers to the presence of hazardous substances in the air that has adverse
effects on health, causing millions premature deaths annually. Ground-based stations can …

Productivity enhancement by prediction of liquid steel breakout during continuous casting process in manufacturing of steel slabs in steel plant using artificial neural …

MO Ansari, S Chattopadhyaya, J Ghose, S Sharma… - Materials, 2022 - mdpi.com
Breakout is one of the major accidents that often arise in the continuous casting shops of
steel slabs in Bokaro Steel Plant, Jharkhand, India. Breakouts cause huge capital loss …

[HTML][HTML] Challenges of cellwise outliers

J Raymaekers, PJ Rousseeuw - Econometrics and Statistics, 2024 - Elsevier
It is well-known that real data often contain outliers. The term outlier usually refers to a case,
usually denoted by a row of the n× d data matrix. In recent times a different type has come …