Corpora in translation studies: An overview and some suggestions for future research
M Baker - Target. International Journal of Translation Studies, 1995 - jbe-platform.com
Corpus-based research has become widely accepted as a factor in improving the
performance of machine translation systems, and corpus-based terminology compilation is …
performance of machine translation systems, and corpus-based terminology compilation is …
Statistical machine translation
A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …
learning problem. By examining many samples of human-produced translation, SMT …
[PDF][PDF] The mathematics of statistical machine translation: Parameter estimation
We describe a series o, f five statistical models o, f the translation process and give
algorithms, for estimating the parameters o, f these models given a set o, f pairs o, f …
algorithms, for estimating the parameters o, f these models given a set o, f pairs o, f …
Accurate methods for the statistics of surprise and coincidence
T Dunning - Computational linguistics, 1994 - books.google.com
There has been a recent trend back towards the statistical analysis of text. This trend has
resulted in a number of researchers doing good work in information retrieval and natural …
resulted in a number of researchers doing good work in information retrieval and natural …
Information retrieval as statistical translation
A Berger, J Lafferty - ACM SIGIR Forum, 2017 - dl.acm.org
We propose a new probabilistic approach to information retrieval based upon the ideas and
methods of statistical machine translation. The central ingredient in this approach is a …
methods of statistical machine translation. The central ingredient in this approach is a …
The web as a parallel corpus
Parallel corpora have become an essential resource for work in multilingual natural
language processing. In this article, we report on our work using the STRAND system for …
language processing. In this article, we report on our work using the STRAND system for …
[PDF][PDF] An IR approach for translating new words from nonparallel, comparable texts
In recent years, there is a phenomenal growth in the amount of online text material available
from the greatest information repository known as the World Wide Web. Various traditional …
from the greatest information repository known as the World Wide Web. Various traditional …
[PDF][PDF] Termight: Identifying and translating technical terminology
We propose a semi-automatic tool, termight, that helps professional translators and
terminologists identify technical terms and their translations. The tool makes use of part-of …
terminologists identify technical terms and their translations. The tool makes use of part-of …
Models of translational equivalence among words
ID Melamed - Computational Linguistics, 2000 - direct.mit.edu
Parallel texts (bitexts) have properties that distinguish them from other kinds of parallel data.
First, most words translate to only one other word. Second, bitext correspondence is typically …
First, most words translate to only one other word. Second, bitext correspondence is typically …
[PDF][PDF] Extracting parallel sentences from comparable corpora using document level alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the
amount of parallel sentences used in training. In recent years, there have been several …
amount of parallel sentences used in training. In recent years, there have been several …