The emerging trends of multi-label learning
Exabytes of data are generated daily by humans, leading to the growing needs for new
efforts in dealing with the grand challenges for multi-label learning brought by big data. For …
efforts in dealing with the grand challenges for multi-label learning brought by big data. For …
Pre-training tasks for embedding-based large-scale retrieval
We consider the large-scale query-document retrieval problem: given a query (eg, a
question), return the set of relevant documents (eg, paragraphs containing the answer) from …
question), return the set of relevant documents (eg, paragraphs containing the answer) from …
Fast multi-resolution transformer fine-tuning for extreme multi-label text classification
Extreme multi-label text classification~(XMC) seeks to find relevant labels from an extreme
large label collection for a given text input. Many real-world applications can be formulated …
large label collection for a given text input. Many real-world applications can be formulated …
Taming pretrained transformers for extreme multi-label text classification
We consider the extreme multi-label text classification (XMC) problem: given an input text,
return the most relevant labels from a large label collection. For example, the input text could …
return the most relevant labels from a large label collection. For example, the input text could …
Matryoshka representation learning
Learned representations are a central component in modern ML systems, serving a
multitude of downstream tasks. When training such representations, it is often the case that …
multitude of downstream tasks. When training such representations, it is often the case that …
Deepxml: A deep extreme multi-label learning framework applied to short text documents
Scalability and accuracy are well recognized challenges in deep extreme multi-label
learning where the objective is to train architectures for automatically annotating a data point …
learning where the objective is to train architectures for automatically annotating a data point …
Decaf: Deep extreme classification with label features
Extreme multi-label classification (XML) involves tagging a data point with its most relevant
subset of labels from an extremely large label set, with several applications such as product …
subset of labels from an extremely large label set, with several applications such as product …
Pecos: Prediction for enormous and correlated output spaces
Many large-scale applications amount to finding relevant results from an enormous output
space of potential candidates. For example, finding the best matching product from a large …
space of potential candidates. For example, finding the best matching product from a large …
XRR: Extreme multi-label text classification with candidate retrieving and deep ranking
Abstract Extreme Multi-label Text Classification (XMTC) is a key task of finding the most
relevant labels from a large label set for a document. Although some deep learning-based …
relevant labels from a large label set for a document. Although some deep learning-based …
Towards class-imbalance aware multi-label learning
Multi-label learning deals with training examples each represented by a single instance
while associated with multiple class labels. Due to the exponential number of possible label …
while associated with multiple class labels. Due to the exponential number of possible label …