Uncertainty in natural language processing: Sources, quantification, and applications

M Hu, Z Zhang, S Zhao, M Huang, B Wu - arxiv preprint arxiv:2306.04459, 2023 - arxiv.org
As a main field of artificial intelligence, natural language processing (NLP) has achieved
remarkable success via deep neural networks. Plenty of NLP tasks have been addressed in …

Domain adaptation and multi-domain adaptation for neural machine translation: A survey

D Saunders - Journal of Artificial Intelligence Research, 2022 - jair.org
The development of deep learning techniques has allowed Neural Machine Translation
(NMT) models to become extremely powerful, given sufficient training data and training time …

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

K Huang, F Mo, H Li, Y Li, Y Zhang, W Yi, Y Mao… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …

Unifying the convergences in multilingual neural machine translation

Y Huang, X Feng, X Geng, B Qin - arxiv preprint arxiv:2205.01620, 2022 - arxiv.org
Although all-in-one-model multilingual neural machine translation (multilingual NMT) has
achieved remarkable progress, the convergence inconsistency in the joint training is …

Towards higher pareto frontier in multilingual machine translation

Y Huang, X Feng, X Geng, B Li, B Qin - arxiv preprint arxiv:2305.15718, 2023 - arxiv.org
Multilingual neural machine translation has witnessed remarkable progress in recent years.
However, the long-tailed distribution of multilingual corpora poses a challenge of Pareto …

Dlut-nlp machine translation systems for wmt24 low-resource indic language translation

C Ju, J Liu, K Huang, D Huang - Proceedings of the Ninth …, 2024 - aclanthology.org
DLUT-NLP Machine Translation Systems for WMT24 Low-Resource Indic Language Translation
Page 1 Proceedings of the Ninth Conference on Machine Translation, pages 742–746 November …

Modeling Both Intra-and Inter-Modality Uncertainty for Multimodal Fake News Detection

L Wei, D Hu, W Zhou, S Hu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Multimodal fake news detection has obtained increasing attention recently. Existing works
generally encode multimodal contents into a deterministic point in semantic subspaces, and …

Importance-Aware Data Augmentation for Document-Level Neural Machine Translation

M Wu, Y Wang, G Foster, L Qu, G Haffari - arxiv preprint arxiv:2401.15360, 2024 - arxiv.org
Document-level neural machine translation (DocNMT) aims to generate translations that are
both coherent and cohesive, in contrast to its sentence-level counterpart. However, due to its …

Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models

M Wu, TT Vu, L Qu, G Haffari - arxiv preprint arxiv:2406.08811, 2024 - arxiv.org
Large language models (LLMs) are typically fine-tuned on diverse and extensive datasets
sourced from various origins to develop a comprehensive range of skills, such as writing …

Optimizing the Training Schedule of Multilingual NMT using Reinforcement Learning

A Allemann, ÀR Atrio, A Popescu-Belis - arxiv preprint arxiv:2410.06118, 2024 - arxiv.org
Multilingual NMT is a viable solution for translating low-resource languages (LRLs) when
data from high-resource languages (HRLs) from the same language family is available …