Neural machine translation: A review

F Stahlberg - Journal of Artificial Intelligence Research, 2020 - jair.org
The field of machine translation (MT), the automatic translation of written text from one
natural language into another, has experienced a major paradigm shift in recent years …

Exploring deep learning-based architecture, strategies, applications and current trends in generic object detection: A comprehensive review

L Aziz, MSBH Salam, UU Sheikh, S Ayub - Ieee Access, 2020 - ieeexplore.ieee.org
Object detection is a fundamental but challenging issue in the field of generic image
analysis; it plays an important role in a wide range of applications and has been receiving …

Wit: Wikipedia-based image text dataset for multimodal multilingual machine learning

K Srinivasan, K Raman, J Chen, M Bendersky… - Proceedings of the 44th …, 2021 - dl.acm.org
The milestone improvements brought about by deep representation learning and pre-
training techniques have led to large performance gains across downstream NLP, IR and …

Multi30k: Multilingual english-german image descriptions

D Elliott, S Frank, K Sima'an, L Specia - arxiv preprint arxiv:1605.00459, 2016 - arxiv.org
We introduce the Multi30K dataset to stimulate multilingual multimodal research. Recent
advances in image description have been demonstrated on English-language datasets …

Visual pivoting for (unsupervised) entity alignment

F Liu, M Chen, D Roth, N Collier - … of the AAAI conference on artificial …, 2021 - ojs.aaai.org
This work studies the use of visual semantic representations to align entities in
heterogeneous knowledge graphs (KGs). Images are natural components of many existing …

Findings of the second shared task on multimodal machine translation and multilingual image description

D Elliott, S Frank, L Barrault, F Bougares… - arxiv preprint arxiv …, 2017 - arxiv.org
We present the results from the second shared task on multimodal machine translation and
multilingual image description. Nine teams submitted 19 systems to two tasks. The …

A shared task on multimodal machine translation and crosslingual image description

L Specia, S Frank, K Sima'An… - First Conference on …, 2016 - research.ed.ac.uk
This paper introduces and summarises the findings of a new shared task at the intersection
of Natural Language Processing and Computer Vision: the generation of image descriptions …

Unpaired image captioning via scene graph alignments

J Gu, S Joty, J Cai, H Zhao, X Yang… - Proceedings of the …, 2019 - openaccess.thecvf.com
Most of current image captioning models heavily rely on paired image-caption datasets.
However, getting large scale image-caption paired data is labor-intensive and time …

Trends in integration of vision and language research: A survey of tasks, datasets, and methods

A Mogadala, M Kalimuthu, D Klakow - Journal of Artificial Intelligence …, 2021 - jair.org
Abstract Interest in Artificial Intelligence (AI) and its applications has seen unprecedented
growth in the last few years. This success can be partly attributed to the advancements made …

Findings of the third shared task on multimodal machine translation

L Barrault, F Bougares, L Specia, C Lala… - Third Conference on …, 2018 - hal.science
We present the results from the third shared task on multimodal machine translation. In this
task a source sentence in English is supplemented by an image and participating systems …