Cross2StrA: Unpaired cross-lingual image captioning with cross-lingual cross-modal structure-pivoted alignment

S Wu, H Fei, W Ji, TS Chua - arxiv preprint arxiv:2305.12260, 2023 - arxiv.org
Unpaired cross-lingual image captioning has long suffered from irrelevancy and disfluency
issues, due to the inconsistencies of the semantic scene and syntax attributes during …

Fine-tuning large language models for domain-specific machine translation

J Zheng, H Hong, F Liu, X Wang, J Su, Y Liang… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have shown great potential in domain-specific machine
translation (MT). However, one major issue is that LLMs pre-trained on general domain …

AI-based personality prediction for human well-being from text data: a systematic review

S Singh, W Singh - Multimedia Tools and Applications, 2024 - Springer
In recent years, people have preferred interacting on social media instead of physical
meetings. Researchers have explored social media text data to predict user personality …

Multi-domain neural machine translation with word-level domain context discrimination

J Zeng, J Su, H Wen, Y Liu, J **e, Y Yin… - Proceedings of the …, 2018 - aclanthology.org
With great practical value, the study of Multi-domain Neural Machine Translation (NMT)
mainly focuses on using mixed-domain parallel sentences to construct a unified model that …

Cross-lingual distillation for text classification

R Xu, Y Yang - arxiv preprint arxiv:1705.02073, 2017 - arxiv.org
Cross-lingual text classification (CLTC) is the task of classifying documents written in
different languages into the same taxonomy of categories. This paper presents a novel …

Bertalign: Improved word embedding-based sentence alignment for Chinese–English parallel corpora of literary texts

L Liu, M Zhu - Digital Scholarship in the Humanities, 2023 - academic.oup.com
Bertalign is designed to improve sentence alignment accuracy for Chinese–English parallel
corpora of literary texts. Aligning bilingual literary texts is not trivial, since most of the …

Pruning-then-expanding model for domain adaptation of neural machine translation

S Gu, Y Feng, W **e - arxiv preprint arxiv:2103.13678, 2021 - arxiv.org
Domain Adaptation is widely used in practical applications of neural machine translation,
which aims to achieve good performance on both the general-domain and in-domain …

Finding sparse structures for domain specific neural machine translation

J Liang, C Zhao, M Wang, X Qiu, L Li - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Neural machine translation often adopts the fine-tuning approach to adapt to specific
domains. However, nonrestricted fine-tuning can easily degrade on the general domain and …

Arabic-English parallel corpus: A new resource for translation training and language teaching

HM Alotaibi - Arab World English Journal (AWEJ) Volume, 2017 - papers.ssrn.com
Parallel corpora can be defined as collections of aligned, translated texts of two or more
languages. They play a major role in translation and contrastive studies, and are also …

Exploring discriminative word-level domain contexts for multi-domain neural machine translation

J Su, J Zeng, J **e, H Wen, Y Yin… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Owing to its practical significance, multi-domain Neural Machine Translation (NMT) has
attracted much attention recently. Recent studies mainly focus on constructing a unified NMT …