Achieving human parity on automatic chinese to english news translation

H Hassan, A Aue, C Chen, V Chowdhary… - arxiv preprint arxiv …, 2018 - arxiv.org
Machine translation has made rapid advances in recent years. Millions of people are using it
today in online translation systems and mobile applications in order to communicate across …

Summarizing source code using a neural attention model

S Iyer, I Konstas, A Cheung… - 54th Annual Meeting …, 2016 - researchportal.hw.ac.uk
High quality source code is often paired with high level summaries of the computation it
performs, for example in code documentation or in descriptions posted in online forums …

Revisiting low-resource neural machine translation: A case study

R Sennrich, B Zhang - arxiv preprint arxiv:1905.11901, 2019 - arxiv.org
It has been shown that the performance of neural machine translation (NMT) drops starkly in
low-resource conditions, underperforming phrase-based statistical machine translation …

Attention-over-attention neural networks for reading comprehension

Y Cui, Z Chen, S Wei, S Wang, T Liu, G Hu - arxiv preprint arxiv …, 2016 - arxiv.org
Cloze-style queries are representative problems in reading comprehension. Over the past
few months, we have seen much progress that utilizing neural network approach to solve …

[PDF][PDF] Farasa: A fast and furious segmenter for arabic

A Abdelali, K Darwish, N Durrani… - Proceedings of the 2016 …, 2016 - aclanthology.org
In this paper, we present Farasa, a fast and accurate Arabic segmenter. Our approach is
based on SVM-rank using linear kernels. We measure the performance of the segmenter in …

The iit bombay english-hindi parallel corpus

A Kunchukuttan, P Mehta, P Bhattacharyya - arxiv preprint arxiv …, 2017 - arxiv.org
We present the IIT Bombay English-Hindi Parallel Corpus. The corpus is a compilation of
parallel corpora previously available in the public domain as well as new parallel corpora …

On the impact of various types of noise on neural machine translation

H Khayrallah, P Koehn - arxiv preprint arxiv:1805.12282, 2018 - arxiv.org
We examine how various types of noise in the parallel training data impact the quality of
neural machine translation systems. We create five types of artificial noise and analyze how …

[PDF][PDF] Scalable modified Kneser-Ney language model estimation

K Heafield, I Pouzyrevsky, JH Clark… - Proceedings of the 51st …, 2013 - aclanthology.org
We present an efficient algorithm to estimate large modified Kneser-Ney models including
interpolation. Streaming and sorting enables the algorithm to scale to much larger models by …

A challenge set approach to evaluating machine translation

P Isabelle, C Cherry, G Foster - arxiv preprint arxiv:1704.07431, 2017 - arxiv.org
Neural machine translation represents an exciting leap forward in translation quality. But
what longstanding weaknesses does it resolve, and which remain? We address these …

What level of quality can neural machine translation attain on literary text?

A Toral, A Way - Translation quality assessment: From principles to …, 2018 - Springer
Given the rise of the new neural approach to machine translation (NMT) and its promising
performance on different text types, we assess the translation quality it can attain on what is …