Improving neural machine translation with conditional sequence generative adversarial nets
This paper proposes an approach for applying GANs to NMT. We build a conditional
sequence generative adversarial net which comprises of two adversarial sub models, a …
sequence generative adversarial net which comprises of two adversarial sub models, a …
deltaBLEU: A discriminative metric for generation tasks with intrinsically diverse targets
We introduce Discriminative BLEU (deltaBLEU), a novel metric for intrinsic evaluation of
generated text in tasks that admit a diverse range of possible outputs. Reference strings are …
generated text in tasks that admit a diverse range of possible outputs. Reference strings are …
Learning paraphrastic sentence embeddings from back-translated bitext
We consider the problem of learning general-purpose, paraphrastic sentence embeddings
in the setting of Wieting et al.(2016b). We use neural machine translation to generate …
in the setting of Wieting et al.(2016b). We use neural machine translation to generate …
[PDF][PDF] The AMARA Corpus: Building Parallel Language Resources for the Educational Domain.
This paper presents the AMARA corpus of on-line educational content: a new parallel
corpus of educational video subtitles, multilingually aligned for 20 languages, ie 20 …
corpus of educational video subtitles, multilingually aligned for 20 languages, ie 20 …
Challenges of neural machine translation for short texts
Short texts (STs) present in a variety of scenarios, including query, dialog, and entity names.
Most of the exciting studies in neural machine translation (NMT) are focused on tackling …
Most of the exciting studies in neural machine translation (NMT) are focused on tackling …
It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk
Minimum Bayes Risk (MBR) decoding is a method for choosing the outputs of a machine
learning system based not on the output with the highest probability, but the output with the …
learning system based not on the output with the highest probability, but the output with the …
End-to-end speech translation with transcoding by multi-task learning for distant language pairs
Directly translating spoken utterances from a source language to a target language is
challenging because it requires a fundamental transformation in both linguistic and para/non …
challenging because it requires a fundamental transformation in both linguistic and para/non …
An automatic evaluation metric for Ancient-Modern Chinese translation
As a written language used for thousands of years, Ancient Chinese has some special
characteristics like complex semantics as polysemy and the one-to-many alignment with …
characteristics like complex semantics as polysemy and the one-to-many alignment with …
A user-study on online adaptation of neural machine translation to human post-edits
The advantages of neural machine translation (NMT) have been extensively validated for
offline translation of several language pairs for different domains of spoken and written …
offline translation of several language pairs for different domains of spoken and written …
[PDF][PDF] Phrasal: A toolkit for new directions in statistical machine translation
We present a new version of Phrasal, an open-source toolkit for statistical phrasebased
machine translation. This revision includes features that support emerging research trends …
machine translation. This revision includes features that support emerging research trends …