A survey of methods for addressing class imbalance in deep-learning based natural language processing

S Henning, W Beluch, A Fraser, A Friedrich - arxiv preprint arxiv …, 2022‏ - arxiv.org
Many natural language processing (NLP) tasks are naturally imbalanced, as some target
categories occur much more frequently than others in the real world. In such scenarios …

Hierarchical graph-based integration network for propaganda detection in textual news articles on social media

PN Ahmad, J Guo, NM AboElenein, QM Haq… - Scientific Reports, 2025‏ - nature.com
During the Covid-19 pandemic, the widespread use of social media platforms has facilitated
the dissemination of information, fake news, and propaganda, serving as a vital source of …

NLRG at SemEval-2021 task 5: Toxic spans detection leveraging BERT-based token classification and span prediction techniques

G Chhablani, A Sharma, H Pandey, Y Bhartia… - arxiv preprint arxiv …, 2021‏ - arxiv.org
Toxicity detection of text has been a popular NLP task in the recent years. In SemEval-2021
Task-5 Toxic Spans Detection, the focus is on detecting toxic spans within passages. Most …

MemeMind at ArAIEval Shared Task: spotting persuasive spans in arabic text with persuasion techniques identification

MR Biswas, Z Shah, W Zaghouani - arxiv preprint arxiv:2408.04540, 2024‏ - arxiv.org
This paper focuses on detecting propagandistic spans and persuasion techniques in Arabic
text from tweets and news paragraphs. Each entry in the dataset contains a text sample and …

CLIMB: Imbalanced Data Modelling Using Contrastive Learning with Limited Labels

A Alsuhaibani, I Razzak, S Jameel, X Wang… - … Conference on Web …, 2024‏ - Springer
Abstract Machine learning classifiers typically rely on the assumption of balanced training
datasets, with sufficient examples per class to facilitate effective model learning. However …

Propaganda detection in Russian and American news coverage about the war in Ukraine through text classification

V Hein - 2023‏ - repositum.tuwien.at
During different decades, propaganda was a vital technique to manipulate human opinions
subtly. With the rise of mass media, subtle opinions can be incepted by various sources …

Majority or Minority: Data Imbalance Learning Method for Named Entity Recognition

S Nemoto, S Kitada, H Iyatomi - IEEE Access, 2024‏ - ieeexplore.ieee.org
Data imbalance presents a significant challenge in various machine learning (ML) tasks,
particularly named entity recognition (NER) within natural language processing (NLP). NER …

Propaganda detection in public Covid-19 discussion on social media

PN Ahmad, AM Shah, KY Lee - 2023‏ - aisel.aisnet.org
As social media has grown exponentially during Covid-19, they have helped disseminate
information, spread fake news and propaganda; thus provide a source of self-reported …