[HTML][HTML] Text classification algorithms: A survey

K Kowsari, K Jafari Meimandi, M Heidarysafa, S Mendu… - Information, 2019 - mdpi.com
In recent years, there has been an exponential growth in the number of complex documents
and texts that require a deeper understanding of machine learning methods to be able to …

Survey on multi-output learning

D Xu, Y Shi, IW Tsang, YS Ong… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
The aim of multi-output learning is to simultaneously predict multiple outputs given an input.
It is an important learning problem for decision-making since making decisions in the real …

Large-scale multi-label text classification on EU legislation

I Chalkidis, M Fergadiotis, P Malakasiotis… - arxiv preprint arxiv …, 2019 - arxiv.org
We consider Large-Scale Multi-Label Text Classification (LMTC) in the legal domain. We
release a new dataset of 57k legislative documents from EURLEX, annotated with~ 4.3 k …

An analysis of hierarchical text classification using word embeddings

RA Stein, PA Jaques, JF Valiati - Information Sciences, 2019 - Elsevier
Efficient distributed numerical word representation models (word embeddings) combined
with modern machine learning algorithms have recently yielded considerable improvement …

Extreme multi-label loss functions for recommendation, tagging, ranking & other missing label applications

H Jain, Y Prabhu, M Varma - Proceedings of the 22nd ACM SIGKDD …, 2016 - dl.acm.org
The choice of the loss function is critical in extreme multi-label learning where the objective
is to annotate each data point with the most relevant subset of labels from an extremely large …

Taming pretrained transformers for extreme multi-label text classification

WC Chang, HF Yu, K Zhong, Y Yang… - Proceedings of the 26th …, 2020 - dl.acm.org
We consider the extreme multi-label text classification (XMC) problem: given an input text,
return the most relevant labels from a large label collection. For example, the input text could …

Dismec: Distributed sparse machines for extreme multi-label classification

R Babbar, B Schölkopf - Proceedings of the tenth ACM international …, 2017 - dl.acm.org
Extreme multi-label classification refers to supervised multi-label learning involving
hundreds of thousands or even millions of labels. Datasets in extreme classification exhibit …

Few-shot and zero-shot multi-label learning for structured label spaces

A Rios, R Kavuluru - … of the conference on empirical methods …, 2018 - pmc.ncbi.nlm.nih.gov
Large multi-label datasets contain labels that occur thousands of times (frequent group),
those that occur only a few times (few-shot group), and labels that never appear in the …

Bonsai: diverse and shallow trees for extreme multi-label classification

S Khandagale, H **ao, R Babbar - Machine Learning, 2020 - Springer
Extreme multi-label classification (XMC) refers to supervised multi-label learning involving
hundreds of thousands or even millions of labels. In this paper, we develop a suite of …

Pd-sparse: A primal and dual sparse approach to extreme multiclass and multilabel classification

IEH Yen, X Huang, P Ravikumar… - … on machine learning, 2016 - proceedings.mlr.press
Abstract We consider Multiclass and Multilabel classification with extremely large number of
classes, of which only few are labeled to each instance. In such setting, standard methods …