Active learning query strategies for classification, regression, and clustering: A survey

P Kumar, A Gupta - Journal of Computer Science and Technology, 2020 - Springer
Generally, data is available abundantly in unlabeled form, and its annotation requires some
cost. The labeling, as well as learning cost, can be minimized by learning with the minimum …

Dealing with class imbalance in classifier chains via random undersampling

B Liu, G Tsoumakas - Knowledge-Based Systems, 2020 - Elsevier
Class imbalance is an intrinsic characteristic of multi-label data. Most of the labels in multi-
label data sets are associated with a small number of training examples, much smaller …

A benchmark and comparison of active learning for logistic regression

Y Yang, M Loog - Pattern Recognition, 2018 - Elsevier
Logistic regression is by far the most widely used classifier in real-world applications. In this
paper, we benchmark the state-of-the-art active learning methods for logistic regression and …

Active learning for hierarchical multi-label classification

FK Nakano, R Cerri, C Vens - Data Mining and Knowledge Discovery, 2020 - Springer
Due to technological advances, a massive amount of data is produced daily, presenting
challenges for application areas where data needs to be labelled by a domain specialist or …

On the classification of software change messages using multi-label active learning

S Gharbi, MW Mkaouer, I Jenhani… - Proceedings of the 34th …, 2019 - dl.acm.org
In this paper, we present a multi-label active learning-based approach to handle the
problem of classification of commit messages. The approach will help developers track …

Granular multilabel batch active learning with pairwise label correlation

Y Zhang, T Zhao, D Miao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Abundant data with limited labeling are a widespread bottleneck in multilabel learning.
Active learning (AL) is an effective solution to gradually enhance model robustness …

Stable matching-based two-way selection in multi-label active learning with imbalanced data

S Chen, R Wang, J Lu, X Wang - Information Sciences, 2022 - Elsevier
Multi-label active learning (MLAL) reduces the cost of manual annotation for multi-label
problems by selecting high-quality unlabeled data. Existing MLAL methods usually perform …

CoMAL: Contrastive Active Learning for Multi-Label Text Classification

C Peng, H Wang, K Chen, L Shou, C Yao… - Proceedings of the 30th …, 2024 - dl.acm.org
Multi-label text classification (MLTC) allows a given text to be associated with multiple
labels, which well suits many real-world data mining scenarios. However, the annotation …

Extending version-space theory to multi-label active learning with imbalanced data

R Wang, S Chen, Y Yu - Pattern Recognition, 2023 - Elsevier
Abstract Version space, defined as the subset of the hypothesis space consistent with the
training samples, is an important concept in supervised learning. It has been successfully …

Deep active learning for multi label text classification

Q Wang, H Zhang, W Zhang, L Dai, Y Liang, H Shi - Scientific Reports, 2024 - nature.com
Given a set of labels, multi-label text classification (MLTC) aims to assign multiple relevant
labels for a text. Recently, deep learning models get inspiring results in MLTC. Training a …