On the linguistic representational power of neural machine translation models
Despite the recent success of deep neural networks in natural language processing and
other spheres of artificial intelligence, their interpretability remains a challenge. We analyze …
other spheres of artificial intelligence, their interpretability remains a challenge. We analyze …
[PDF][PDF] Neural machine translation of rare words with subword units
R Sennrich - arxiv preprint arxiv:1508.07909, 2015 - research.ed.ac.uk
Neural machine translation (NMT) models typically operate with a fixed vocabulary, but
translation is an open-vocabulary problem. Previous work addresses the translation of out-of …
translation is an open-vocabulary problem. Previous work addresses the translation of out-of …
What do neural machine translation models learn about morphology?
Neural machine translation (MT) models obtain state-of-the-art performance while
maintaining a simple, end-to-end architecture. However, little is known about what these …
maintaining a simple, end-to-end architecture. However, little is known about what these …
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP
What are the units of text that we want to model? From bytes to multi-word expressions, text
can be analyzed and generated at many granularities. Until recently, most natural language …
can be analyzed and generated at many granularities. Until recently, most natural language …
Fast domain adaptation for neural machine translation
M Freitag, Y Al-Onaizan - arxiv preprint arxiv:1612.06897, 2016 - arxiv.org
Neural Machine Translation (NMT) is a new approach for automatic translation of text from
one human language into another. The basic concept in NMT is to train a large Neural …
one human language into another. The basic concept in NMT is to train a large Neural …
[PDF][PDF] Optimizing Chinese word segmentation for machine translation performance
Previous work has shown that Chinese word segmentation is useful for machine translation
to English, yet the way different segmentation strategies affect MT is still poorly understood …
to English, yet the way different segmentation strategies affect MT is still poorly understood …
[PDF][PDF] Montreal neural machine translation systems for WMT'15
Neural machine translation (NMT) systems have recently achieved results comparable to the
state of the art on a few translation tasks, including English→ French and English→ German …
state of the art on a few translation tasks, including English→ French and English→ German …
[PDF][PDF] Improved statistical machine translation using paraphrases
Parallel corpora are crucial for training SMT systems. However, for many language pairs
they are available only in very limited quantities. For these language pairs a huge portion of …
they are available only in very limited quantities. For these language pairs a huge portion of …
Discovering salient neurons in deep NLP models
While a lot of work has been done in understanding representations learned within deep
NLP models and what knowledge they capture, work done towards analyzing individual …
NLP models and what knowledge they capture, work done towards analyzing individual …
CCOHA: Clean corpus of historical American English
Modelling language change is an increasingly important area of interest within the fields of
sociolinguistics and historical linguistics. In recent years, there has been a growing number …
sociolinguistics and historical linguistics. In recent years, there has been a growing number …