- Academic Search

T Nakazawa, H Nakayama, C Ding… - Proceedings of the …, 2021 - aclanthology.org

This paper presents the results of the shared tasks from the 8th workshop on Asian
translation (WAT2021). For the WAT2021, 28 teams participated in the shared tasks and 24 …

保存引用被引用数: 178 関連記事全 15 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Exploring better text image translation with multimodal codebook

Z Lan, J Yu, X Li, W Zhang, J Luan, B Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

Text image translation (TIT) aims to translate the source texts embedded in the image to
target translations, which has a wide range of applications and thus has important research …

保存引用被引用数: 14 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Video-Helpful Multimodal Machine Translation

Y Li, S Shimizu, C Chu, S Kurohashi, W Li - arxiv preprint arxiv …, 2023 - arxiv.org

Existing multimodal machine translation (MMT) datasets consist of images and video
captions or instructional video subtitles, which rarely contain linguistic ambiguity, making …

保存引用被引用数: 3 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Bigvideo: A large-scale video subtitle translation dataset for multimodal machine translation

L Kang, L Huang, N Peng, P Zhu, Z Sun… - arxiv preprint arxiv …, 2023 - arxiv.org

We present a large-scale video subtitle translation dataset, BigVideo, to facilitate the study of
multi-modality machine translation. Compared with the widely used How2 and VaTeX …

保存引用被引用数: 7 関連記事全 3 バージョン HTMLバージョン

Vision talks: Visual relationship-enhanced transformer for video-guided machine translation

S Chen, Y Zeng, D Cao, S Lu - Expert Systems with Applications, 2022 - Elsevier

Video-guided machine translation is a promising task which aims to translate a source
language description into a target language utilizing the video information as supplementary …

保存引用被引用数: 2 関連記事全 2 バージョン

[Free GPT-4]

[PDF] arxiv.org

Visa: An ambiguous subtitles dataset for visual scene-aware machine translation

Y Li, S Shimizu, W Gu, C Chu, S Kurohashi - arxiv preprint arxiv …, 2022 - arxiv.org

Existing multimodal machine translation (MMT) datasets consist of images and video
captions or general subtitles, which rarely contain linguistic ambiguity, making visual …

保存引用被引用数: 8 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] aclanthology.org

TriFine: A Large-Scale Dataset of Vision-Audio-Subtitle for Tri-Modal Machine Translation and Benchmark with Fine-Grained Annotated Tags

B Guan, Y Zhang, Y Zhao, C Zong - Proceedings of the 31st …, 2025 - aclanthology.org

Current video-guided machine translation (VMT) approaches primarily use coarse-grained
visual information, resulting in information redundancy, high computational overhead, and …

保存引用関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] aclanthology.org

Overview of the 10th Workshop on Asian Translation

T Nakazawa, K Kinugawa, H Mino, I Goto… - Proceedings of the …, 2023 - aclanthology.org

This paper presents the results of the shared tasks from the 10th workshop on Asian
translation (WAT2023). For the WAT2023, 2 teams submitted their translation results for the …

保存引用被引用数: 32 関連記事全 8 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

H Shen, L Shao, W Li, Z Lan, Z Liu, J Su - arxiv preprint arxiv:2405.12669, 2024 - arxiv.org

In recent years, multi-modal machine translation has attracted significant interest in both
academia and industry due to its superior performance. It takes both textual and visual …

保存引用被引用数: 2 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] kyoto-u.ac.jp

[PDF][PDF] Studies on Subword-based Low-Resource Neural Machine Translation: Segmentation, Encoding, and Decoding

S Haiyue - 2024 - repository.kulib.kyoto-u.ac.jp

In a world rich with diverse ideas and cultures, humans are isolated into islands of distinct
languages. Machine translation (MT) serves as a bridge, facilitating information access and …

保存引用関連記事 HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Video-guided machine translation with spatial hierarchical attention network

Overview of the 8th workshop on Asian translation

Exploring better text image translation with multimodal codebook

Video-Helpful Multimodal Machine Translation

Bigvideo: A large-scale video subtitle translation dataset for multimodal machine translation

Vision talks: Visual relationship-enhanced transformer for video-guided machine translation

Visa: An ambiguous subtitles dataset for visual scene-aware machine translation

TriFine: A Large-Scale Dataset of Vision-Audio-Subtitle for Tri-Modal Machine Translation and Benchmark with Fine-Grained Annotated Tags

Overview of the 10th Workshop on Asian Translation

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

[PDF][PDF] Studies on Subword-based Low-Resource Neural Machine Translation: Segmentation, Encoding, and Decoding