On the linguistic representational power of neural machine translation models

Y Belinkov, N Durrani, F Dalvi, H Sajjad… - Computational …, 2020 - direct.mit.edu
Despite the recent success of deep neural networks in natural language processing and
other spheres of artificial intelligence, their interpretability remains a challenge. We analyze …

[PDF][PDF] Neural machine translation of rare words with subword units

R Sennrich - arxiv preprint arxiv:1508.07909, 2015 - research.ed.ac.uk
Neural machine translation (NMT) models typically operate with a fixed vocabulary, but
translation is an open-vocabulary problem. Previous work addresses the translation of out-of …

What do neural machine translation models learn about morphology?

Y Belinkov, N Durrani, F Dalvi, H Sajjad… - arxiv preprint arxiv …, 2017 - arxiv.org
Neural machine translation (MT) models obtain state-of-the-art performance while
maintaining a simple, end-to-end architecture. However, little is known about what these …

Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP

SJ Mielke, Z Alyafeai, E Salesky, C Raffel… - arxiv preprint arxiv …, 2021 - arxiv.org
What are the units of text that we want to model? From bytes to multi-word expressions, text
can be analyzed and generated at many granularities. Until recently, most natural language …

Fast domain adaptation for neural machine translation

M Freitag, Y Al-Onaizan - arxiv preprint arxiv:1612.06897, 2016 - arxiv.org
Neural Machine Translation (NMT) is a new approach for automatic translation of text from
one human language into another. The basic concept in NMT is to train a large Neural …

[PDF][PDF] Optimizing Chinese word segmentation for machine translation performance

PC Chang, M Galley, CD Manning - Proceedings of the third …, 2008 - aclanthology.org
Previous work has shown that Chinese word segmentation is useful for machine translation
to English, yet the way different segmentation strategies affect MT is still poorly understood …

[PDF][PDF] Montreal neural machine translation systems for WMT'15

S Jean, O Firat, K Cho, R Memisevic… - Proceedings of the …, 2015 - aclanthology.org
Neural machine translation (NMT) systems have recently achieved results comparable to the
state of the art on a few translation tasks, including English→ French and English→ German …

[PDF][PDF] Improved statistical machine translation using paraphrases

C Callison-Burch, P Koehn… - Proceedings of the Human …, 2006 - aclanthology.org
Parallel corpora are crucial for training SMT systems. However, for many language pairs
they are available only in very limited quantities. For these language pairs a huge portion of …

Discovering salient neurons in deep NLP models

N Durrani, F Dalvi, H Sajjad - Journal of Machine Learning Research, 2023 - jmlr.org
While a lot of work has been done in understanding representations learned within deep
NLP models and what knowledge they capture, work done towards analyzing individual …

CCOHA: Clean corpus of historical American English

R Alatrash, D Schlechtweg, J Kuhn… - Proceedings of the …, 2020 - aclanthology.org
Modelling language change is an increasingly important area of interest within the fields of
sociolinguistics and historical linguistics. In recent years, there has been a growing number …