Finding function in form: Compositional character models for open vocabulary word representation
We introduce a model for constructing vector representations of words by composing
characters using bidirectional LSTMs. Relative to traditional word representation models that …
characters using bidirectional LSTMs. Relative to traditional word representation models that …
Efficient cluster-based k-nearest-neighbor machine translation
k-Nearest-Neighbor Machine Translation (kNN-MT) has been recently proposed as a non-
parametric solution for domain adaptation in neural machine translation (NMT). It aims to …
parametric solution for domain adaptation in neural machine translation (NMT). It aims to …
[PDF][PDF] Bilingually-constrained phrase embeddings for machine translation
Abstract We propose Bilingually-constrained Recursive Auto-encoders (BRAE) to learn
semantic phrase embeddings (compact vector representations for phrases), which can …
semantic phrase embeddings (compact vector representations for phrases), which can …
Efficient phrase table pruning for Hindi to English machine translation through syntactic and marker-based filtering and hybrid similarity measurement
N Chatterjee, S Gupta - Natural Language Engineering, 2019 - cambridge.org
For a given training corpus of parallel sentences, the quality of the output produced by a
translation system relies heavily on the underlying similarity measurement criteria. A phrase …
translation system relies heavily on the underlying similarity measurement criteria. A phrase …
Connecting phrase based statistical machine translation adaptation
Although more additional corpora are now available for Statistical Machine Translation
(SMT), only the ones which belong to the same or similar domains with the original corpus …
(SMT), only the ones which belong to the same or similar domains with the original corpus …
A relationship: Word alignment, phrase table, and translation quality
In the last years, researchers conducted several studies to evaluate the machine translation
quality based on the relationship between word alignments and phrase table. However …
quality based on the relationship between word alignments and phrase table. However …
[PDF][PDF] Dealing with data sparseness in SMT with factured models and morphological expansion: a case study on Croatian
This paper describes our experience using available linguistic resources for Croatian in
order to address data sparseness when building an English-to-Croatian general domain …
order to address data sparseness when building an English-to-Croatian general domain …
Divergence-based fine pruning of phrase-based statistical translation model
A widely used automatic translation approach, phrase-based statistical machine translation,
learns a probabilistic translation model composed of phrases from a large parallel corpus …
learns a probabilistic translation model composed of phrases from a large parallel corpus …
[PDF][PDF] Improving the precision of automatically constructed human-oriented translation dictionaries
A Antonova, A Misyurev - Proceedings of the 3rd workshop on …, 2014 - aclanthology.org
In this paper we address the problem of automatic acquisition of a human-oriented
translation dictionary from a large-scale parallel corpus. The initial translation equivalents …
translation dictionary from a large-scale parallel corpus. The initial translation equivalents …
Techniques for pruning phrase tables for statistical machine translation
R Zens, P Xu - US Patent 8,990,069, 2015 - Google Patents
A computer-implemented technique includes receiving, at a server including one or more
processors, a phrase table for statistical machine translation, the phrase table including a …
processors, a phrase table for statistical machine translation, the phrase table including a …