Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Beyond english-centric multilingual machine translation
Existing work in translation demonstrated the potential of massively multilingual machine
translation by training a single model able to translate between any pair of languages …
translation by training a single model able to translate between any pair of languages …
Probabilistic topic modeling in multilingual settings: An overview of its methodology and applications
Probabilistic topic models are unsupervised generative models which model document
content as a two-step generation process, that is, documents are observed as mixtures of …
content as a two-step generation process, that is, documents are observed as mixtures of …
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia
We present an approach based on multilingual sentence embeddings to automatically
extract parallel sentences from the content of Wikipedia articles in 85 languages, including …
extract parallel sentences from the content of Wikipedia articles in 85 languages, including …
ParaCrawl: Web-scale acquisition of parallel corpora
We report on methods to create the largest publicly available parallel corpora by crawling
the web, using open source software. We empirically compare alternative methods and …
the web, using open source software. We empirically compare alternative methods and …
Bitext alignment
J Tiedemann - 2011 - books.google.com
This book provides an overview of various techniques for the alignment of bitexts. It
describes general concepts and strategies that can be applied to map corresponding parts …
describes general concepts and strategies that can be applied to map corresponding parts …
A survey of domain adaptation for machine translation
Neural machine translation (NMT) is a deep learning based approach for machine
translation, which outperforms traditional statistical machine translation (SMT) and yields the …
translation, which outperforms traditional statistical machine translation (SMT) and yields the …
CCMatrix: Mining billions of high-quality parallel sentences on the web
We show that margin-based bitext mining in a multilingual sentence space can be applied to
monolingual corpora of billions of sentences. We are using ten snapshots of a curated …
monolingual corpora of billions of sentences. We are using ten snapshots of a curated …
Margin-based parallel corpus mining with multilingual sentence embeddings
Machine translation is highly sensitive to the size and quality of the training data, which has
led to an increasing interest in collecting and filtering large parallel corpora. In this paper, we …
led to an increasing interest in collecting and filtering large parallel corpora. In this paper, we …
Crowdsourcing and online collaborative translations
MA Jiménez-Crespo - 2017 - torrossa.com
We control the world basically because we are the only animals that can cooperate flexibly
in very large numbers […] This is something very unique to us, perhaps the most unique …
in very large numbers […] This is something very unique to us, perhaps the most unique …
Improving multilingual sentence embedding using bi-directional dual encoder with additive margin softmax
In this paper, we present an approach to learn multilingual sentence embeddings using a bi-
directional dual-encoder with additive margin softmax. The embeddings are able to achieve …
directional dual-encoder with additive margin softmax. The embeddings are able to achieve …