Text data augmentation for deep learning
Abstract Natural Language Processing (NLP) is one of the most captivating applications of
Deep Learning. In this survey, we consider how the Data Augmentation training strategy can …
Deep Learning. In this survey, we consider how the Data Augmentation training strategy can …
Neural machine translation for low-resource languages: A survey
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …
the early 2000s and has already entered a mature phase. While considered the most widely …
Causal machine learning: A survey and open problems
Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …
that formalize the data-generation process as a structural causal model (SCM). This …
LexSym: Compositionality as lexical symmetry
In tasks like semantic parsing, instruction following, and question answering, standard deep
networks fail to generalize compositionally from small datasets. Many existing approaches …
networks fail to generalize compositionally from small datasets. Many existing approaches …
Sequence-to-sequence learning with latent neural grammars
Y Kim - Advances in Neural Information Processing …, 2021 - proceedings.neurips.cc
Sequence-to-sequence learning with neural networks has become the de facto standard for
sequence modeling. This approach typically models the local distribution over the next …
sequence modeling. This approach typically models the local distribution over the next …
The robustness of counterfactual explanations over time
Counterfactual explanations are a prominent example of post-hoc interpretability methods in
the explainable Artificial Intelligence (AI) research domain. Differently from other explanation …
the explainable Artificial Intelligence (AI) research domain. Differently from other explanation …
CipherDAug: Ciphertext based data augmentation for neural machine translation
We propose a novel data-augmentation technique for neural machine translation based on
ROT-$ k $ ciphertexts. ROT-$ k $ is a simple letter substitution cipher that replaces a letter in …
ROT-$ k $ ciphertexts. ROT-$ k $ is a simple letter substitution cipher that replaces a letter in …
Improving conversational recommendation systems via counterfactual data simulation
Conversational recommender systems~(CRSs) aim to provide recommendation services via
natural language conversations. Although a number of approaches have been proposed for …
natural language conversations. Although a number of approaches have been proposed for …
Counterfactual data augmentation via perspective transition for open-domain dialogues
The construction of open-domain dialogue systems requires high-quality dialogue datasets.
The dialogue data admits a wide variety of responses for a given dialogue history, especially …
The dialogue data admits a wide variety of responses for a given dialogue history, especially …
Augmenting multi-turn text-to-SQL datasets with self-play
The task of context-dependent text-to-SQL aims to convert multi-turn user utterances to
formal SQL queries. This is a challenging task due to both the scarcity of training data from …
formal SQL queries. This is a challenging task due to both the scarcity of training data from …