A comprehensive survey of deep learning for image captioning

MDZ Hossain, F Sohel, MF Shiratuddin… - ACM Computing Surveys …, 2019‏ - dl.acm.org
Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …

Generation and comprehension of unambiguous object descriptions

J Mao, J Huang, A Toshev… - Proceedings of the …, 2016‏ - openaccess.thecvf.com
We propose a method that can generate an unambiguous description (known as a referring
expression) of a specific object or region in an image, and which can also comprehend or …

[PDF][PDF] Referitgame: Referring to objects in photographs of natural scenes

S Kazemzadeh, V Ordonez, M Matten… - Proceedings of the 2014 …, 2014‏ - aclanthology.org
In this paper we introduce a new game to crowd-source natural language referring
expressions. By designing a two player game, we can both collect and verify referring …

Computational generation of referring expressions: A survey

E Krahmer, K Van Deemter - Computational Linguistics, 2012‏ - direct.mit.edu
This article offers a survey of computational research on referring expression generation
(REG). It introduces the REG problem and describes early work in this area, discussing what …

[ספר][B] Computational models of referring: a study in cognitive science

K Van Deemter - 2016‏ - books.google.com
An argument that computational models can shed light on referring, a fundamental and
much-studied aspect of communication. To communicate, speakers need to make it clear …

Rethinking symbolic and visual context in Referring Expression Generation

S Schüz, A Gatt, S Zarrieß - Frontiers in Artificial Intelligence, 2023‏ - frontiersin.org
Situational context is crucial for linguistic reference to visible objects, since the same
description can refer unambiguously to an object in one context but be ambiguous or …

Caesar: An embodied simulator for generating multimodal referring expression datasets

MM Islam, R Mirzaiee, A Gladstone… - Advances in Neural …, 2022‏ - proceedings.neurips.cc
Humans naturally use verbal utterances and nonverbal gestures to refer to various objects
(known as $\textit {referring expressions} $) in different interactional scenarios. As collecting …

[PDF][PDF] Evaluating algorithms for the generation of referring expressions using a balanced corpus

A Gatt, I Van Der Sluis… - Proceedings of the 11th …, 2007‏ - research.rug.nl
Despite being the focus of intensive research, evaluation of algorithms that generate
referring expressions is still in its infancy. We describe a corpusbased evaluation …

[PDF][PDF] Generating expressions that refer to visible objects

M Mitchell, K Van Deemter, E Reiter - Proceedings of the 2013 …, 2013‏ - aclanthology.org
We introduce a novel algorithm for generating referring expressions, informed by human
and computer vision and designed to refer to visible objects. Our method separates absolute …

Learning in the rational speech acts model

W Monroe, C Potts - arxiv preprint arxiv:1510.06807, 2015‏ - arxiv.org
The Rational Speech Acts (RSA) model treats language use as a recursive process in which
probabilistic speaker and listener agents reason about each other's intentions to enrich the …