A comparative survey of instance selection methods applied to non-neural and transformer-based text classification

W Cunha, F Viegas, C França, T Rosa, L Rocha… - ACM Computing …, 2023 - dl.acm.org
Progress in natural language processing has been dictated by the rule of more: more data,
more computing power, more complexity, best exemplified by deep learning Transformers …

On the cost-effectiveness of neural and non-neural approaches and representations for text classification: A comprehensive comparative study

W Cunha, V Mangaravite, C Gomes, S Canuto… - Information Processing …, 2021 - Elsevier
This article brings two major contributions. First, we present the results of a critical analysis
of recent scientific articles about neural and non-neural approaches and representations for …

An effective, efficient, and scalable confidence-based instance selection framework for transformer-based text classification

W Cunha, C França, G Fonseca, L Rocha… - Proceedings of the 46th …, 2023 - dl.acm.org
Transformer-based deep learning is currently the state-of-the-art in many NLP and IR tasks.
However, fine-tuning such Transformers for specific tasks, especially in scenarios of ever …

Improving the performance of sentiment analysis using enhanced preprocessing technique and artificial neural network

A Thakkar, D Mungra, A Agrawal… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
With the presence of a massive amount of digitally recorded data, an automated
computation can be preferable over the manual approach to evaluate sentiments within …

On the class separability of contextual embeddings representations–or “the classifier does not matter when the (text) representation is so good!”

CMV de Andrade, FM Belem, W Cunha… - Information Processing …, 2023 - Elsevier
The literature has not fully and adequately explained why contextual (eg, BERT-based)
representations are so successful to improve the effectiveness of some Natural Language …

[HTML][HTML] Public's mental health monitoring via sentimental analysis of financial text using machine learning techniques

SA Alanazi, A Khaliq, F Ahmad, N Alshammari… - International Journal of …, 2022 - mdpi.com
Public feelings and reactions associated with finance are gaining significant importance as
they help individuals, public health, financial and non-financial institutions, and the …

[HTML][HTML] CDFRS: A scalable sampling approach for efficient big data analysis

Y Cai, D Wu, X Sun, S Wu, J Xu, JZ Huang - Information Processing & …, 2024 - Elsevier
The sampling-based approximation method has demonstrated its potential in various
domains such as machine learning, query processing, and data analysis. Most preceding …

On representation learning-based methods for effective, efficient, and scalable code retrieval

C França, RC Lima, C Andrade, W Cunha… - Neurocomputing, 2024 - Elsevier
Code retrieval consists of finding relevant code snippets regarding a programmer's query—
an increasingly important task due to software ubiquity. Although significant progress has …

A Noise-Oriented and Redundancy-Aware Instance Selection Framework

W Cunha, A Moreo Fernández, A Esuli… - ACM Transactions on …, 2025 - dl.acm.org
Fine-tuning transformer-based deep-learning models are currently at the forefront of natural
language processing (NLP) and information retrieval (IR) tasks. However, fine-tuning these …