A survey on data augmentation for text classification
Data augmentation, the artificial creation of training data for machine learning by
transformations, is a widely studied research field across machine learning disciplines …
transformations, is a widely studied research field across machine learning disciplines …
Pre-trained language models for text generation: A survey
Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …
A survey of data augmentation approaches for NLP
Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …
resource domains, new tasks, and the popularity of large-scale neural networks that require …
A survey on recent approaches for natural language processing in low-resource scenarios
Deep neural networks and huge language models are becoming omnipresent in natural
language applications. As they are known for requiring large amounts of training data, there …
language applications. As they are known for requiring large amounts of training data, there …
AEDA: an easier data augmentation technique for text classification
This paper proposes AEDA (An Easier Data Augmentation) technique to help improve the
performance on text classification tasks. AEDA includes only random insertion of …
performance on text classification tasks. AEDA includes only random insertion of …
Promda: Prompt-based data augmentation for low-resource nlu tasks
This paper focuses on the Data Augmentation for low-resource Natural Language
Understanding (NLU) tasks. We propose Prompt-based D} ata Augmentation model …
Understanding (NLU) tasks. We propose Prompt-based D} ata Augmentation model …
MELM: Data augmentation with masked entity language modeling for low-resource NER
Data augmentation is an effective solution to data scarcity in low-resource scenarios.
However, when applied to token-level tasks such as NER, data augmentation methods often …
However, when applied to token-level tasks such as NER, data augmentation methods often …
MulDA: A multilingual data augmentation framework for low-resource cross-lingual NER
Abstract Named Entity Recognition (NER) for low-resource languages is a both practical and
challenging research problem. This paper addresses zero-shot transfer for cross-lingual …
challenging research problem. This paper addresses zero-shot transfer for cross-lingual …
A survey on arabic named entity recognition: Past, recent advances, and future trends
As more and more Arabic texts emerged on the Internet, extracting important information
from these Arabic texts is especially useful. As a fundamental technology, Named entity …
from these Arabic texts is especially useful. As a fundamental technology, Named entity …
Language-guided music recommendation for video via prompt analogies
We propose a method to recommend music for an input video while allowing a user to guide
music selection with free-form natural language. A key challenge of this problem setting is …
music selection with free-form natural language. A key challenge of this problem setting is …