Automatic translation alignment for ancient greek and latin
This paper presents the results of automatic translation alignment experiments on a corpus
of texts in Ancient Greek translated into Latin. We used a state-of-the-art alignment workflow …
of texts in Ancient Greek translated into Latin. We used a state-of-the-art alignment workflow …
Grapheme-to-phoneme transduction for cross-language ASR
M Hasegawa-Johnson, L Rolston… - … on Statistical Language …, 2020 - Springer
Automatic speech recognition (ASR) can be deployed in a previously unknown language, in
less than 24 h, given just three resources: an acoustic model trained on other languages, a …
less than 24 h, given just three resources: an acoustic model trained on other languages, a …
Tilde's machine translation systems for WMT 2018
The paper describes the development process of the Tilde's NMT systems that were
submitted for the WMT 2018 shared task on news translation. We describe the data filtering …
submitted for the WMT 2018 shared task on news translation. We describe the data filtering …
Tilde's parallel corpus filtering methods for WMT 2018
M Pinnis - Proceedings of the Third Conference on Machine …, 2018 - aclanthology.org
The paper describes parallel corpus filtering methods that allow reducing noise of noisy
“parallel” corpora from a level where the corpora are not usable for neural machine …
“parallel” corpora from a level where the corpora are not usable for neural machine …
Integrated parallel sentence and fragment extraction from comparable corpora: A case study on Chinese--Japanese Wikipedia
Parallel corpora are crucial for statistical machine translation (SMT); however, they are quite
scarce for most language pairs and domains. As comparable corpora are far more available …
scarce for most language pairs and domains. As comparable corpora are far more available …
Multilingual topic models for bilingual dictionary extraction
A machine-readable bilingual dictionary plays a crucial role in many natural language
processing tasks, such as statistical machine translation and cross-language information …
processing tasks, such as statistical machine translation and cross-language information …
[PDF][PDF] Collection of bilingual data for lexicon transfer learning
L Rolston, K Kirchhoff - Univ. Washington, Seattle …, 2016 - vannevar.ece.uw.edu
This technical report describes the collection and format of a dataset of bilingual lexicons for
50 languages, undertaken as part of the DARPA LORELEI project on develo** language …
50 languages, undertaken as part of the DARPA LORELEI project on develo** language …
[PDF][PDF] Assigning terms to domains by document classification
In this paper we investigate a number of questions relating to the identification of the domain
of a term by domain classification of the document in which the term occurs. We propose and …
of a term by domain classification of the document in which the term occurs. We propose and …
[PDF][PDF] Visual Evaluation of Translation Alignment Data.
Translation alignment plays a crucial role in various applications in natural language
processing and digital humanities. With the recent advance in neural machine translation …
processing and digital humanities. With the recent advance in neural machine translation …
Building a multi-domain comparable corpus using a learning to rank method
Comparable corpora are key translation resources for both languages and domains with
limited linguistic resources. The existing approaches for building comparable corpora are …
limited linguistic resources. The existing approaches for building comparable corpora are …