Multimodal chain-of-thought reasoning in language models

Z Zhang, A Zhang, M Li, H Zhao, G Karypis… - ar** multimodal machine translation (MMT)
systems that enhance neural machine translation (NMT) with visual knowledge. This …

Valhalla: Visual hallucination for machine translation

Y Li, R Panda, Y Kim, CFR Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Designing better machine translation systems by considering auxiliary inputs such as
images has attracted much attention in recent years. While existing methods show promising …