[HTML][HTML] Data augmentation approaches in natural language processing: A survey
As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where
deep learning techniques may fail. It is widely applied in computer vision then introduced to …
deep learning techniques may fail. It is widely applied in computer vision then introduced to …
A survey on data augmentation for text classification
Data augmentation, the artificial creation of training data for machine learning by
transformations, is a widely studied research field across machine learning disciplines …
transformations, is a widely studied research field across machine learning disciplines …
A survey of data augmentation approaches for NLP
Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …
resource domains, new tasks, and the popularity of large-scale neural networks that require …
Neural machine translation for low-resource languages: A survey
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …
the early 2000s and has already entered a mature phase. While considered the most widely …
Findings of the 2019 conference on machine translation (WMT19)
This paper presents the results of the premier shared task organized alongside the
Conference on Machine Translation (WMT) 2019. Participants were asked to build machine …
Conference on Machine Translation (WMT) 2019. Participants were asked to build machine …
An empirical survey of data augmentation for limited data learning in nlp
NLP has achieved great progress in the past decade through the use of neural models and
large labeled datasets. The dependence on abundant data prevents NLP models from being …
large labeled datasets. The dependence on abundant data prevents NLP models from being …
An analysis of simple data augmentation for named entity recognition
Simple yet effective data augmentation techniques have been proposed for sentence-level
and sentence-pair natural language processing tasks. Inspired by these efforts, we design …
and sentence-pair natural language processing tasks. Inspired by these efforts, we design …
Sequence-level mixed sample data augmentation
Despite their empirical success, neural networks still have difficulty capturing compositional
aspects of natural language. This work proposes a simple data augmentation approach to …
aspects of natural language. This work proposes a simple data augmentation approach to …
Gradient imitation reinforcement learning for low resource relation extraction
Low-resource Relation Extraction (LRE) aims to extract relation facts from limited labeled
corpora when human annotation is scarce. Existing works either utilize self-training scheme …
corpora when human annotation is scarce. Existing works either utilize self-training scheme …
Learning to generalize to more: Continuous semantic augmentation for neural machine translation
The principal task in supervised neural machine translation (NMT) is to learn to generate
target sentences conditioned on the source inputs from a set of parallel sentence pairs, and …
target sentences conditioned on the source inputs from a set of parallel sentence pairs, and …