الباحث العلمي من Google

S Ranathunga, ESA Lee, M Prifti Skenduli… - ACM Computing …, 2023‏ - dl.acm.org‏

Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …‏

حفظ اقتباس تم اقتباسها في عدد: 279 مقالات ذات صلة الإصدارات الـ 6كلها

[Free GPT-4]

[PDF] jair.org Full View‏

Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text‏

S Gehrmann, E Clark, T Sellam - Journal of Artificial Intelligence Research, 2023‏ - jair.org‏

Abstract Evaluation practices in natural language generation (NLG) have many known flaws,
but improved evaluation approaches are rarely widely adopted. This issue has become …‏

حفظ اقتباس تم اقتباسها في عدد: 158 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

No language left behind: Scaling human-centered machine translation‏

MR Costa-jussà, J Cross, O Çelebi, M Elbayad… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Driven by the goal of eradicating language barriers on a global scale, machine translation
has solidified itself as a key focus of artificial intelligence research today. However, such …‏

حفظ اقتباس تم اقتباسها في عدد: 804 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]

[PDF] jmlr.org

Beyond english-centric multilingual machine translation‏

A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky… - Journal of Machine …, 2021‏ - jmlr.org‏

Existing work in translation demonstrated the potential of massively multilingual machine
translation by training a single model able to translate between any pair of languages …‏

حفظ اقتباس تم اقتباسها في عدد: 856 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

[Free GPT-4]

[PDF] uzh.ch

Findings of the 2019 conference on machine translation (WMT19)‏

L Barrault, O Bojar, MR Costa-Jussa, C Federmann… - 2019‏ - zora.uzh.ch‏

This paper presents the results of the premier shared task organized alongside the
Conference on Machine Translation (WMT) 2019. Participants were asked to build machine …‏

حفظ اقتباس تم اقتباسها في عدد: 776 مقالات ذات صلة الإصدارات الـ 13كلها إصدار HTML‏

[Free GPT-4]

[PDF] mit.edu

Survey of low-resource machine translation‏

B Haddow, R Bawden, AVM Barone, J Helcl… - Computational …, 2022‏ - direct.mit.edu‏

We present a survey covering the state of the art in low-resource machine translation (MT)
research. There are currently around 7,000 languages spoken in the world and almost all …‏

حفظ اقتباس تم اقتباسها في عدد: 175 مقالات ذات صلة الإصدارات الـ 13كلها

[Free GPT-4]

[PDF] arxiv.org

Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia‏

H Schwenk, V Chaudhary, S Sun, H Gong… - arxiv preprint arxiv …, 2019‏ - arxiv.org‏

We present an approach based on multilingual sentence embeddings to automatically
extract parallel sentences from the content of Wikipedia articles in 85 languages, including …‏

حفظ اقتباس تم اقتباسها في عدد: 367 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]

[PDF] strath.ac.uk

ParaCrawl: Web-scale acquisition of parallel corpora‏

M Bañón, P Chen, B Haddow, K Heafield, H Hoang… - 2020‏ - strathprints.strath.ac.uk‏

We report on methods to create the largest publicly available parallel corpora by crawling
the web, using open source software. We empirically compare alternative methods and …‏

حفظ اقتباس تم اقتباسها في عدد: 274 مقالات ذات صلة الإصدارات الـ 17كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Detecting hallucinated content in conditional neural sequence generation‏

C Zhou, G Neubig, J Gu, M Diab, P Guzman… - arxiv preprint arxiv …, 2020‏ - arxiv.org‏

Neural sequence models can generate highly fluent sentences, but recent studies have also
shown that they are also prone to hallucinate additional content not supported by the input …‏

حفظ اقتباس تم اقتباسها في عدد: 205 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

CCMatrix: Mining billions of high-quality parallel sentences on the web‏

H Schwenk, G Wenzek, S Edunov, E Grave… - arxiv preprint arxiv …, 2019‏ - arxiv.org‏

We show that margin-based bitext mining in a multilingual sentence space can be applied to
monolingual corpora of billions of sentences. We are using ten snapshots of a curated …‏

حفظ اقتباس تم اقتباسها في عدد: 241 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Findings of the WMT 2019 shared task on parallel corpus filtering for low-resource conditions

Neural machine translation for low-resource languages: A survey‏

Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text‏

No language left behind: Scaling human-centered machine translation‏

Beyond english-centric multilingual machine translation‏

Findings of the 2019 conference on machine translation (WMT19)‏

Survey of low-resource machine translation‏

Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia‏

ParaCrawl: Web-scale acquisition of parallel corpora‏

Detecting hallucinated content in conditional neural sequence generation‏

CCMatrix: Mining billions of high-quality parallel sentences on the web‏