Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Madlad-400: A multilingual and document-level large audited dataset
We introduce MADLAD-400, a manually audited, general domain 3T token monolingual
dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations …
dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations …
Findings of the 2019 conference on machine translation (WMT19)
This paper presents the results of the premier shared task organized alongside the
Conference on Machine Translation (WMT) 2019. Participants were asked to build machine …
Conference on Machine Translation (WMT) 2019. Participants were asked to build machine …
Findings of the 2021 conference on machine translation (WMT21)
F Akhbardeh, A Arkhangorodsky, M Biesialska… - Proceedings of the sixth …, 2021 - cris.fbk.eu
This paper presents the results of the news translation task, the multilingual low-resource
translation for Indo-European languages, the triangular translation task, and the automatic …
translation for Indo-European languages, the triangular translation task, and the automatic …
Neural machine translation with byte-level subwords
Almost all existing machine translation models are built on top of character-based
vocabularies: characters, subwords or words. Rare characters from noisy text or character …
vocabularies: characters, subwords or words. Rare characters from noisy text or character …
Data augmentation using back-translation for context-aware neural machine translation
A Sugiyama, N Yoshinaga - … of the fourth workshop on discourse …, 2019 - aclanthology.org
A single sentence does not always convey information that is enough to translate it into other
languages. Some target languages need to add or specialize words that are omitted or …
languages. Some target languages need to add or specialize words that are omitted or …
MTNT: A testbed for machine translation of noisy text
Noisy or non-standard input text can cause disastrous mistranslations in most modern
Machine Translation (MT) systems, and there has been growing research interest in creating …
Machine Translation (MT) systems, and there has been growing research interest in creating …
Machine translation and its evaluation: a study
Abstract Machine translation (namely MT) has been one of the most popular fields in
computational linguistics and Artificial Intelligence (AI). As one of the most promising …
computational linguistics and Artificial Intelligence (AI). As one of the most promising …
Mural: multimodal, multitask retrieval across languages
Both image-caption pairs and translation pairs provide the means to learn deep
representations of and connections between languages. We use both types of pairs in …
representations of and connections between languages. We use both types of pairs in …
Findings of the first shared task on machine translation robustness
We share the findings of the first shared task on improving robustness of Machine
Translation (MT). The task provides a testbed representing challenges facing MT models …
Translation (MT). The task provides a testbed representing challenges facing MT models …
JParaCrawl: A large scale web-based English-Japanese parallel corpus
Recent machine translation algorithms mainly rely on parallel corpora. However, since the
availability of parallel corpora remains limited, only some resource-rich language pairs can …
availability of parallel corpora remains limited, only some resource-rich language pairs can …