Overview of the 8th workshop on Asian translation

T Nakazawa, H Nakayama, C Ding… - Proceedings of the …, 2021 - aclanthology.org
This paper presents the results of the shared tasks from the 8th workshop on Asian
translation (WAT2021). For the WAT2021, 28 teams participated in the shared tasks and 24 …

Exploring better text image translation with multimodal codebook

Z Lan, J Yu, X Li, W Zhang, J Luan, B Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Text image translation (TIT) aims to translate the source texts embedded in the image to
target translations, which has a wide range of applications and thus has important research …

Video-Helpful Multimodal Machine Translation

Y Li, S Shimizu, C Chu, S Kurohashi, W Li - arxiv preprint arxiv …, 2023 - arxiv.org
Existing multimodal machine translation (MMT) datasets consist of images and video
captions or instructional video subtitles, which rarely contain linguistic ambiguity, making …

Bigvideo: A large-scale video subtitle translation dataset for multimodal machine translation

L Kang, L Huang, N Peng, P Zhu, Z Sun… - arxiv preprint arxiv …, 2023 - arxiv.org
We present a large-scale video subtitle translation dataset, BigVideo, to facilitate the study of
multi-modality machine translation. Compared with the widely used How2 and VaTeX …

Vision talks: Visual relationship-enhanced transformer for video-guided machine translation

S Chen, Y Zeng, D Cao, S Lu - Expert Systems with Applications, 2022 - Elsevier
Video-guided machine translation is a promising task which aims to translate a source
language description into a target language utilizing the video information as supplementary …

Visa: An ambiguous subtitles dataset for visual scene-aware machine translation

Y Li, S Shimizu, W Gu, C Chu, S Kurohashi - arxiv preprint arxiv …, 2022 - arxiv.org
Existing multimodal machine translation (MMT) datasets consist of images and video
captions or general subtitles, which rarely contain linguistic ambiguity, making visual …

TriFine: A Large-Scale Dataset of Vision-Audio-Subtitle for Tri-Modal Machine Translation and Benchmark with Fine-Grained Annotated Tags

B Guan, Y Zhang, Y Zhao, C Zong - Proceedings of the 31st …, 2025 - aclanthology.org
Current video-guided machine translation (VMT) approaches primarily use coarse-grained
visual information, resulting in information redundancy, high computational overhead, and …

Overview of the 10th Workshop on Asian Translation

T Nakazawa, K Kinugawa, H Mino, I Goto… - Proceedings of the …, 2023 - aclanthology.org
This paper presents the results of the shared tasks from the 10th workshop on Asian
translation (WAT2023). For the WAT2023, 2 teams submitted their translation results for the …

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

H Shen, L Shao, W Li, Z Lan, Z Liu, J Su - arxiv preprint arxiv:2405.12669, 2024 - arxiv.org
In recent years, multi-modal machine translation has attracted significant interest in both
academia and industry due to its superior performance. It takes both textual and visual …

[PDF][PDF] Studies on Subword-based Low-Resource Neural Machine Translation: Segmentation, Encoding, and Decoding

S Haiyue - 2024 - repository.kulib.kyoto-u.ac.jp
In a world rich with diverse ideas and cultures, humans are isolated into islands of distinct
languages. Machine translation (MT) serves as a bridge, facilitating information access and …