A comprehensive survey of deep learning for image captioning
Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …
recognizing the important objects, their attributes, and their relationships in an image. It also …
Generation and comprehension of unambiguous object descriptions
We propose a method that can generate an unambiguous description (known as a referring
expression) of a specific object or region in an image, and which can also comprehend or …
expression) of a specific object or region in an image, and which can also comprehend or …
[PDF][PDF] Referitgame: Referring to objects in photographs of natural scenes
In this paper we introduce a new game to crowd-source natural language referring
expressions. By designing a two player game, we can both collect and verify referring …
expressions. By designing a two player game, we can both collect and verify referring …
Computational generation of referring expressions: A survey
This article offers a survey of computational research on referring expression generation
(REG). It introduces the REG problem and describes early work in this area, discussing what …
(REG). It introduces the REG problem and describes early work in this area, discussing what …
[ספר][B] Computational models of referring: a study in cognitive science
K Van Deemter - 2016 - books.google.com
An argument that computational models can shed light on referring, a fundamental and
much-studied aspect of communication. To communicate, speakers need to make it clear …
much-studied aspect of communication. To communicate, speakers need to make it clear …
Rethinking symbolic and visual context in Referring Expression Generation
Situational context is crucial for linguistic reference to visible objects, since the same
description can refer unambiguously to an object in one context but be ambiguous or …
description can refer unambiguously to an object in one context but be ambiguous or …
Caesar: An embodied simulator for generating multimodal referring expression datasets
Humans naturally use verbal utterances and nonverbal gestures to refer to various objects
(known as $\textit {referring expressions} $) in different interactional scenarios. As collecting …
(known as $\textit {referring expressions} $) in different interactional scenarios. As collecting …
[PDF][PDF] Evaluating algorithms for the generation of referring expressions using a balanced corpus
Despite being the focus of intensive research, evaluation of algorithms that generate
referring expressions is still in its infancy. We describe a corpusbased evaluation …
referring expressions is still in its infancy. We describe a corpusbased evaluation …
[PDF][PDF] Generating expressions that refer to visible objects
We introduce a novel algorithm for generating referring expressions, informed by human
and computer vision and designed to refer to visible objects. Our method separates absolute …
and computer vision and designed to refer to visible objects. Our method separates absolute …
Learning in the rational speech acts model
The Rational Speech Acts (RSA) model treats language use as a recursive process in which
probabilistic speaker and listener agents reason about each other's intentions to enrich the …
probabilistic speaker and listener agents reason about each other's intentions to enrich the …