Statistical machine translation

A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …

Hierarchical phrase-based translation

D Chiang - computational linguistics, 2007 - direct.mit.edu
We present a statistical machine translation model that uses hierarchical phrases—phrases
that contain subphrases. The model is formally a synchronous context-free grammar but is …

[PDF][PDF] Posterior regularization for structured latent variable models

K Ganchev, J Graça, J Gillenwater, B Taskar - The Journal of Machine …, 2010 - jmlr.org
We present posterior regularization, a probabilistic framework for structured, weakly
supervised learning. Our framework efficiently incorporates indirect supervision via …

[PDF][PDF] Sentence simplification by monolingual machine translation

S Wubben, A Van Den Bosch… - Proceedings of the 50th …, 2012 - aclanthology.org
In this paper we describe a method for simplifying sentences using Phrase Based Machine
Translation, augmented with a re-ranking heuristic based on dissimilarity, and trained on a …

LexSym: Compositionality as lexical symmetry

E Akyürek, J Andreas - Proceedings of the 61st Annual Meeting of …, 2023 - aclanthology.org
In tasks like semantic parsing, instruction following, and question answering, standard deep
networks fail to generalize compositionally from small datasets. Many existing approaches …

compare-mt: A tool for holistic comparison of language generation systems

G Neubig, ZY Dou, J Hu, P Michel, D Pruthi… - arxiv preprint arxiv …, 2019 - arxiv.org
In this paper, we describe compare-mt, a tool for holistic analysis and comparison of the
results of systems for language generation tasks such as machine translation. The main goal …

Structured prediction cascades

D Weiss, B Taskar - Proceedings of the Thirteenth …, 2010 - proceedings.mlr.press
Structured prediction tasks pose a fundamental trade-off between the need for model
complexity to increase predictive power and the limited computational resources for …

Soft syntactic constraints for Arabic–English hierarchical phrase-based translation

Y Marton, D Chiang, P Resnik - Machine Translation, 2012 - Springer
In adding syntax to statistical machine translation, there is a tradeoff between taking
advantage of linguistic analysis and allowing the model to exploit parallel training data with …

[PDF][PDF] Hierarchical phrase-based translation with suffix arrays

A Lopez - Proceedings of the 2007 Joint Conference on Empirical …, 2007 - aclanthology.org
A major engineering challenge in statistical machine translation systems is the efficient
representation of extremely large translation rulesets. In phrase-based models, this problem …

A survey of word reordering in statistical machine translation: Computational models and language phenomena

A Bisazza, M Federico - Computational linguistics, 2016 - direct.mit.edu
Word reordering is one of the most difficult aspects of statistical machine translation (SMT),
and an important factor of its quality and efficiency. Despite the vast amount of research …