Text data augmentation for deep learning

C Shorten, TM Khoshgoftaar, B Furht - Journal of big Data, 2021 - Springer
Abstract Natural Language Processing (NLP) is one of the most captivating applications of
Deep Learning. In this survey, we consider how the Data Augmentation training strategy can …

Neural machine translation for low-resource languages: A survey

S Ranathunga, ESA Lee, M Prifti Skenduli… - ACM Computing …, 2023 - dl.acm.org
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …

Causal machine learning: A survey and open problems

J Kaddour, A Lynch, Q Liu, MJ Kusner… - arxiv preprint arxiv …, 2022 - arxiv.org
Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …

LexSym: Compositionality as lexical symmetry

E Akyürek, J Andreas - Proceedings of the 61st Annual Meeting of …, 2023 - aclanthology.org
In tasks like semantic parsing, instruction following, and question answering, standard deep
networks fail to generalize compositionally from small datasets. Many existing approaches …

Sequence-to-sequence learning with latent neural grammars

Y Kim - Advances in Neural Information Processing …, 2021 - proceedings.neurips.cc
Sequence-to-sequence learning with neural networks has become the de facto standard for
sequence modeling. This approach typically models the local distribution over the next …

The robustness of counterfactual explanations over time

A Ferrario, M Loi - Ieee Access, 2022 - ieeexplore.ieee.org
Counterfactual explanations are a prominent example of post-hoc interpretability methods in
the explainable Artificial Intelligence (AI) research domain. Differently from other explanation …

CipherDAug: Ciphertext based data augmentation for neural machine translation

N Kambhatla, L Born, A Sarkar - arxiv preprint arxiv:2204.00665, 2022 - arxiv.org
We propose a novel data-augmentation technique for neural machine translation based on
ROT-$ k $ ciphertexts. ROT-$ k $ is a simple letter substitution cipher that replaces a letter in …

Improving conversational recommendation systems via counterfactual data simulation

X Wang, K Zhou, X Tang, WX Zhao, F Pan… - Proceedings of the 29th …, 2023 - dl.acm.org
Conversational recommender systems~(CRSs) aim to provide recommendation services via
natural language conversations. Although a number of approaches have been proposed for …

Counterfactual data augmentation via perspective transition for open-domain dialogues

J Ou, J Zhang, Y Feng, J Zhou - arxiv preprint arxiv:2210.16838, 2022 - arxiv.org
The construction of open-domain dialogue systems requires high-quality dialogue datasets.
The dialogue data admits a wide variety of responses for a given dialogue history, especially …

Augmenting multi-turn text-to-SQL datasets with self-play

Q Liu, Z Ye, T Yu, P Blunsom, L Song - arxiv preprint arxiv:2210.12096, 2022 - arxiv.org
The task of context-dependent text-to-SQL aims to convert multi-turn user utterances to
formal SQL queries. This is a challenging task due to both the scarcity of training data from …