Filter feature selection methods for text classification: a review

H Ming, W Heyong - Multimedia Tools and Applications, 2024 - Springer
Filter feature selection methods are utilized to select discriminative terms from high-
dimensional text data to improve text classification performance and reduce computational …

Predicting and analyzing injury severity: A machine learning-based approach using class-imbalanced proactive and reactive data

S Sarkar, A Pramanik, J Maiti, G Reniers - Safety science, 2020 - Elsevier
Although the utility of the machine learning (ML) techniques is established in occupational
accident domain using reactive data, its exploration in predicting injury severity using both …

Fine-tuned generative LLM oversampling can improve performance over traditional techniques on multiclass imbalanced text classification

NA Cloutier, N Japkowicz - 2023 IEEE International conference …, 2023 - ieeexplore.ieee.org
A common challenge in classification and data mining is the class imbalance problem,
where one class in a dataset has significantly more samples than another. The presence of …

The impact of deep learning on document classification using semantically rich representations

Z Kastrati, AS Imran, SY Yayilgan - Information Processing & Management, 2019 - Elsevier
This paper presents a semantically rich document representation model for automatically
classifying financial documents into predefined categories utilizing deep learning. The …

Supervised Hebb rule based feature selection for text classification

H Wang, M Hong - Information Processing & Management, 2019 - Elsevier
Text documents usually contain high dimensional non-discriminative (irrelevant and noisy)
terms which lead to steep computational costs and poor learning performance of text …

Improved recurrent neural networks for text classification and dynamic Sylvester equation solving

W Chen, J **, D Gerontitis, L Qiu, J Zhu - Neural Processing Letters, 2023 - Springer
The solution of the text classification and time-varying problems are two basic practical
problems frequently encountered in the fields of science and engineering, and most of the …

[HTML][HTML] Improving ontology-based text classification: An occupational health and security application

N Sanchez-Pi, L Martí, ACB Garcia - Journal of Applied Logic, 2016 - Elsevier
Abstract Information retrieval has been widely studied due to the growing amounts of textual
information available electronically. Nowadays organizations and industries are facing the …

Safer: a robust and efficient framework for fine-tuning bert-based classifier with noisy labels

Z Qi, X Tan, C Qu, Y Xu, Y Qi - … of the 61st Annual Meeting of the …, 2023 - aclanthology.org
Learning on noisy datasets is a challenging problem when pre-trained language models are
applied to real-world text classification tasks. In numerous industrial applications, acquiring …

Classification and pattern extraction of incidents: A deep learning-based approach

S Sarkar, S Vinay, C Djeddi, J Maiti - Neural Computing and Applications, 2022 - Springer
Classifying or predicting occupational incidents using both structured and unstructured (text)
data are an unexplored area of research. Unstructured texts, ie, incident narratives are often …

A method for optimizing text preprocessing and text classification using multiple cycles of learning with an application on shipbrokers emails

G Papageorgiou, P Economou… - Journal of Applied …, 2024 - Taylor & Francis
Optimizing text preprocessing and text classification algorithms is an important, everyday
task in large organizations and companies and it usually involves a labor-intensive and time …