A holistic approach to undesired content detection in the real world
We present a holistic approach to building a robust and useful natural language
classification system for real-world content moderation. The success of such a system relies …
classification system for real-world content moderation. The success of such a system relies …
A survey of active learning for natural language processing
In this work, we provide a survey of active learning (AL) for its applications in natural
language processing (NLP). In addition to a fine-grained categorization of query strategies …
language processing (NLP). In addition to a fine-grained categorization of query strategies …
A Survey on Deep Active Learning: Recent Advances and New Frontiers
Active learning seeks to achieve strong performance with fewer training samples. It does this
by iteratively asking an oracle to label newly selected samples in a human-in-the-loop …
by iteratively asking an oracle to label newly selected samples in a human-in-the-loop …
Cache & distil: Optimising API calls to large language models
Large-scale deployment of generative AI tools often depends on costly API calls to a Large
Language Model (LLM) to fulfil user queries. To curtail the frequency of these calls, one can …
Language Model (LLM) to fulfil user queries. To curtail the frequency of these calls, one can …
Active learning for natural language generation
The field of Natural Language Generation (NLG) suffers from a severe shortage of labeled
data due to the extremely expensive and time-consuming process involved in manual …
data due to the extremely expensive and time-consuming process involved in manual …
Phrase-level active learning for neural machine translation
Neural machine translation (NMT) is sensitive to domain shift. In this paper, we address this
problem in an active learning setting where we can spend a given budget on translating in …
problem in an active learning setting where we can spend a given budget on translating in …
Turn-Level Active Learning for Dialogue State Tracking
Dialogue state tracking (DST) plays an important role in task-oriented dialogue systems.
However, collecting a large amount of turn-by-turn annotated dialogue data is costly and …
However, collecting a large amount of turn-by-turn annotated dialogue data is costly and …
Bayesian active learning with pretrained language models
Active Learning (AL) is a method to iteratively select data for annotation from a pool of
unlabeled data, aiming to achieve better model performance than random selection …
unlabeled data, aiming to achieve better model performance than random selection …
Active learning for neural machine translation
N Vashistha, K Singh, R Shakya - arxiv preprint arxiv:2301.00688, 2022 - arxiv.org
The machine translation mechanism translates texts automatically between different natural
languages, and Neural Machine Translation (NMT) has gained attention for its rational …
languages, and Neural Machine Translation (NMT) has gained attention for its rational …
[PDF][PDF] CHIA: CHoosing instances to annotate for machine translation
Neural machine translation (MT) systems have been shown to perform poorly on low-
resource language pairs, for which large-scale parallel data is unavailable. Making the data …
resource language pairs, for which large-scale parallel data is unavailable. Making the data …