A scenario-generic neural machine translation data augmentation method

X Liu, J He, M Liu, Z Yin, L Yin, W Zheng - Electronics, 2023‏ - mdpi.com
Amid the rapid advancement of neural machine translation, the challenge of data sparsity
has been a major obstacle. To address this issue, this study proposes a general data …

The neural machine translation models for the low-resource Kazakh–English language pair

V Karyukin, D Rakhimova, A Karibayeva… - PeerJ Computer …, 2023‏ - peerj.com
The development of the machine translation field was driven by people's need to
communicate with each other globally by automatically translating words, sentences, and …

Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

Y Wan, K Bao, D Liu, B Yang, DF Wong… - arxiv preprint arxiv …, 2022‏ - arxiv.org
In this report, we present our submission to the WMT 2022 Metrics Shared Task. We build
our system based on the core idea of UNITE (Unified Translation Evaluation), which unifies …

Guiding ontology translation with hubness-aware translation memory

M Tian, F Giunchiglia, R Song, H Xu - Expert Systems with Applications, 2025‏ - Elsevier
Ontology, as the foundational architecture for knowledge representation, necessitates
multilingualization to facilitate cross-lingual knowledge sharing, posing challenges that …

Identifying light verb constructions in indonesian: a direct translation approach: a direct translation approach

DS Nugraha - International Journal of Language and Literary Studies, 2022‏ - mail.ijlls.org
This study aimed to identify light verb constructions (LVCs) in Indonesian based on machine
translation methods, namely binary translation or direct translation. Based on the method …

Domain adapted machine translation: What does catastrophic forgetting forget and why?

D Saunders, S DeNeefe - arxiv preprint arxiv:2412.17537, 2024‏ - arxiv.org
Neural Machine Translation (NMT) models can be specialized by domain adaptation, often
involving fine-tuning on a dataset of interest. This process risks catastrophic forgetting: rapid …

Enhancing Machine Translation Experiences with Multilingual Knowledge Graphs

S Conia, D Lee, M Li, UF Minhas, Y Li - Proceedings of the AAAI …, 2024‏ - ojs.aaai.org
Translating entity names, especially when a literal translation is not correct, poses a
significant challenge. Although Machine Translation (MT) systems have achieved …

PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese

T Osório, B Leite, HL Cardoso, L Gomes… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Leveraging research on the neural modelling of Portuguese, we contribute a collection of
datasets for an array of language processing tasks and a corresponding collection of fine …

Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task

K Bao, Y Wan, D Liu, B Yang, W Lei, X He… - arxiv preprint arxiv …, 2022‏ - arxiv.org
In this paper, we present our submission to the sentence-level MQM benchmark at Quality
Estimation Shared Task, named UniTE (Unified Translation Evaluation). Specifically, our …

Effective approaches to neural query language identification

X Ren, B Yang, D Liu, H Zhang, X Lv, L Yao… - Computational …, 2022‏ - direct.mit.edu
Query language identification (Q-LID) plays a crucial role in a cross-lingual search engine.
There exist two main challenges in Q-LID:(1) insufficient contextual information in queries for …