Prompting large language model for machine translation: A case study

B Zhang, B Haddow, A Birch - International Conference on …, 2023 - proceedings.mlr.press
Research on prompting has shown excellent performance with little or even no supervised
training across many tasks. However, prompting for machine translation is still under …

Machine translation meta evaluation through translation accuracy challenge sets

N Moghe, A Fazla, C Amrhein, T Kocmi… - Computational …, 2024 - direct.mit.edu
Recent machine translation (MT) metrics calibrate their effectiveness by correlating with
human judgment. However, these results are often obtained by averaging predictions across …

The use of residual analysis to improve the error rate accuracy of machine translation

Ľ Benko, D Munkova, M Munk, L Benkova, P Hajek - Scientific Reports, 2024 - nature.com
The aim of the study is to compare two different approaches to machine translation—
statistical and neural—using automatic MT metrics of error rate and residuals. We examined …

Breeding machine translations: Evolutionary approach to survive and thrive in the world of automated evaluation

J Jon, O Bojar - arxiv preprint arxiv:2305.19330, 2023 - arxiv.org
We propose a genetic algorithm (GA) based method for modifying n-best lists produced by a
machine translation (MT) system. Our method offers an innovative approach to improving MT …

Metric score landscape challenge (MSLC23): Understanding metrics' performance on a wider landscape of translation quality

C Lo, S Larkin, R Knowles - … of the Eighth Conference on Machine …, 2023 - aclanthology.org
Abstract The Metric Score Landscape Challenge (MSLC23) dataset aims to gain insight into
metric scores on a broader/wider landscape of machine translation (MT) quality. It provides a …

ACES: Translation accuracy challenge sets at WMT 2023

C Amrhein, N Moghe, L Guillou - arxiv preprint arxiv:2311.01153, 2023 - arxiv.org
We benchmark the performance of segmentlevel metrics submitted to WMT 2023 using the
ACES Challenge Set (Amrhein et al., 2022). The challenge set consists of 36K examples …

Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering

CH Park, K Lee, H Lim, J Kim, J Park… - Findings of the …, 2024 - aclanthology.org
Building a reliable visual question answering (VQA) system across different languages is a
challenging problem, primarily due to the lack of abundant samples for training. To address …

Reranking for natural language generation from logical forms: A study based on large language models

L Haroutunian, Z Li, L Galescu, P Cohen… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated impressive capabilities in natural
language generation. However, their output quality can be inconsistent, posing challenges …

Evaluating automatic metrics with incremental machine translation systems

G Wu, SB Cohen, R Sennrich - arxiv preprint arxiv:2407.03277, 2024 - arxiv.org
We introduce a dataset comprising commercial machine translations, gathered weekly over
six years across 12 translation directions. Since human A/B testing is commonly used, we …

Latest Research in Data Augmentation for Low Resource Language Text Translation: A Review

AD Latief, A Jarin, Y Yaniasih, DIN Afra… - … , Informatics and its …, 2024 - ieeexplore.ieee.org
The translation of low-resource languages remains a significant challenge in Natural
Language Processing (NLP) due to the scarcity of high-quality parallel data for training …