From show to tell: A survey on deep learning-based image captioning
Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …
reason, large research efforts have been devoted to image captioning, ie describing images …
Supervised learning of semantic classes for image annotation and retrieval
A probabilistic formulation for semantic image annotation and retrieval is proposed.
Annotation and retrieval are posed as classification problems where each class is defined …
Annotation and retrieval are posed as classification problems where each class is defined …
On social networks and collaborative recommendation
Social network systems, like last. fm, play a significant role in Web 2.0, containing large
amounts of multimedia-enriched data that are enhanced both by explicit user-provided …
amounts of multimedia-enriched data that are enhanced both by explicit user-provided …
Tag ranking
Social media sharing web sites like Flickr allow users to annotate images with free tags,
which significantly facilitate Web image search and organization. However, the tags …
which significantly facilitate Web image search and organization. However, the tags …
Semantic interdisciplinary evaluation of image captioning models
U Sirisha, B Sai Chandana - Cogent Engineering, 2022 - Taylor & Francis
In our day-to-day life, synchronizing vision and language aspects plays a crucial role in
solving various real-time challenges. Image captioning is one of them, and it aims to …
solving various real-time challenges. Image captioning is one of them, and it aims to …
Annosearch: Image auto-annotation by search
Although it has been studied for several years by computer vision and machine learning
communities, image annotation is still far from practical. In this paper, we present …
communities, image annotation is still far from practical. In this paper, we present …
Show, edit and tell: a framework for editing image captions
Most image captioning frameworks generate captions directly from images, learning a
map** from visual features to natural language. However, editing existing captions can be …
map** from visual features to natural language. However, editing existing captions can be …
Video search reranking through random walk over document-level context graph
Multimedia search over distributed sources often result in recurrent images or videos which
are manifested beyond the textual modality. To exploit such contextual patterns and keep …
are manifested beyond the textual modality. To exploit such contextual patterns and keep …
Unifying guilt-by-association approaches: Theorems and fast algorithms
If several friends of Smith have committed petty thefts, what would you say about Smith?
Most people would not be surprised if Smith is a hardened criminal. Guilt-by-association …
Most people would not be surprised if Smith is a hardened criminal. Guilt-by-association …
Annotating images by mining image search results
In this paper, we propose a novel attempt of model-free image annotation which annotates
images by mining their search results. It contains three steps: 1) the search process to …
images by mining their search results. It contains three steps: 1) the search process to …