A systematic review on imbalanced data challenges in machine learning: Applications and solutions

H Kaur, HS Pannu, AK Malhi - ACM computing surveys (CSUR), 2019 - dl.acm.org
In machine learning, the data imbalance imposes challenges to perform data analytics in
almost all areas of real-world research. The raw primary data often suffers from the skewed …

Learning from class-imbalanced data: Review of methods and applications

G Haixiang, L Yi**g, J Shang, G Mingyun… - Expert systems with …, 2017 - Elsevier
Rare events, especially those that could potentially negatively impact society, often require
humans' decision-making responses. Detecting rare events can be viewed as a prediction …

IGAN-IDS: An imbalanced generative adversarial network towards intrusion detection system in ad-hoc networks

S Huang, K Lei - Ad Hoc Networks, 2020 - Elsevier
With the emergence of ever-advancing network threats, the guarantee of system security
becomes increasingly crucial, especially in the dynamic and decentralized ad-hoc networks …

Attribute selection and imbalanced data: Problems in software defect prediction

TM Khoshgoftaar, K Gao… - 2010 22nd IEEE …, 2010 - ieeexplore.ieee.org
The data mining and machine learning community is often faced with two key problems:
working with imbalanced data and selecting the best features for machine learning. This …

Grouped SMOTE with noise filtering mechanism for classifying imbalanced data

K Cheng, C Zhang, H Yu, X Yang, H Zou, S Gao - IEEE Access, 2019 - ieeexplore.ieee.org
SMOTE (Synthetic Minority Oversampling TEchnique) is one of the most popular and well-
known sampling algorithms for addressing class imbalance learning problem. The merits of …

PWG-IDS: an intrusion detection model for solving class imbalance in IIoT networks using generative adversarial networks

L Zhang, S Jiang, X Shen, BB Gupta, Z Tian - arxiv preprint arxiv …, 2021 - arxiv.org
With the continuous development of industrial IoT (IIoT) technology, network security is
becoming more and more important. And intrusion detection is an important part of its …

Nature-inspired techniques in the context of fraud detection

M Behdad, L Barone, M Bennamoun… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Electronic fraud is highly lucrative, with estimates suggesting these crimes to be worth
millions of dollars annually. Because of its complex nature, electronic fraud detection is …

LW-ELM: A fast and flexible cost-sensitive learning framework for classifying imbalanced data

H Yu, C Sun, X Yang, S Zheng, Q Wang, X ** - IEEE Access, 2018 - ieeexplore.ieee.org
Learning from imbalanced data is a challenging task in the fields of machine learning and
data mining. As an effective and efficient solution, cost-sensitive learning has been widely …

Exploring discrepancies in findings obtained with the KDD Cup'99 data set

V Engen, J Vincent, K Phalp - Intelligent Data Analysis, 2011 - content.iospress.com
The KDD Cup'99 data set has been widely used to evaluate intrusion detection prototypes,
most based on machine learning techniques, for nearly a decade. The data set served well …

Joint Sample Position Based Noise Filtering and Mean Shift Clustering for Imbalanced Classification Learning

L Duan, W Xue, J Huang… - Tsinghua Science and …, 2023 - ieeexplore.ieee.org
The problem of imbalanced data classification learning has received much attention.
Conventional classification algorithms are susceptible to data skew to favor majority …