Corpora in translation studies: An overview and some suggestions for future research

M Baker - Target. International Journal of Translation Studies, 1995 - jbe-platform.com
Corpus-based research has become widely accepted as a factor in improving the
performance of machine translation systems, and corpus-based terminology compilation is …

Statistical machine translation

A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …

[PDF][PDF] The mathematics of statistical machine translation: Parameter estimation

PF Brown, SA Della Pietra, VJ Della Pietra… - Computational …, 1993 - aclanthology.org
We describe a series o, f five statistical models o, f the translation process and give
algorithms, for estimating the parameters o, f these models given a set o, f pairs o, f …

Accurate methods for the statistics of surprise and coincidence

T Dunning - Computational linguistics, 1994 - books.google.com
There has been a recent trend back towards the statistical analysis of text. This trend has
resulted in a number of researchers doing good work in information retrieval and natural …

Information retrieval as statistical translation

A Berger, J Lafferty - ACM SIGIR Forum, 2017 - dl.acm.org
We propose a new probabilistic approach to information retrieval based upon the ideas and
methods of statistical machine translation. The central ingredient in this approach is a …

The web as a parallel corpus

P Resnik, NA Smith - Computational Linguistics, 2003 - direct.mit.edu
Parallel corpora have become an essential resource for work in multilingual natural
language processing. In this article, we report on our work using the STRAND system for …

[PDF][PDF] An IR approach for translating new words from nonparallel, comparable texts

P Fung, LY Yee - 36th Annual Meeting of the Association for …, 1998 - aclanthology.org
In recent years, there is a phenomenal growth in the amount of online text material available
from the greatest information repository known as the World Wide Web. Various traditional …

[PDF][PDF] Termight: Identifying and translating technical terminology

I Dagan, K Church - Fourth Conference on Applied Natural …, 1994 - aclanthology.org
We propose a semi-automatic tool, termight, that helps professional translators and
terminologists identify technical terms and their translations. The tool makes use of part-of …

Models of translational equivalence among words

ID Melamed - Computational Linguistics, 2000 - direct.mit.edu
Parallel texts (bitexts) have properties that distinguish them from other kinds of parallel data.
First, most words translate to only one other word. Second, bitext correspondence is typically …

[PDF][PDF] Extracting parallel sentences from comparable corpora using document level alignment

J Smith, C Quirk, K Toutanova - … of the North American chapter of …, 2010 - aclanthology.org
The quality of a statistical machine translation (SMT) system is heavily dependent upon the
amount of parallel sentences used in training. In recent years, there have been several …