- Academic Search

MDZ Hossain, F Sohel, MF Shiratuddin… - ACM Computing Surveys …, 2019 - dl.acm.org

Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …

Zapisz Cytuj Cytowane przez 1017 Powiązane artykuły Wszystkie wersje 8

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multimodal machine learning: A survey and taxonomy

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org

Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

Zapisz Cytuj Cytowane przez 3901 Powiązane artykuły Wszystkie wersje 12

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Stacked cross attention for image-text matching

KH Lee, X Chen, G Hua, H Hu… - Proceedings of the …, 2018 - openaccess.thecvf.com

In this paper, we study the problem of image-text matching. Inferring the latent semantic
alignment between objects or other salient stuff (eg snow, sky, lawn) and the corresponding …

Zapisz Cytuj Cytowane przez 1461 Powiązane artykuły Wszystkie wersje 8 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multimodal transformer with multi-view visual representation for image captioning

J Yu, J Li, Z Yu, Q Huang - … on circuits and systems for video …, 2019 - ieeexplore.ieee.org

Image captioning aims to automatically generate a natural language description of a given
image, and most state-of-the-art models have adopted an encoder-decoder framework. The …

Zapisz Cytuj Cytowane przez 447 Powiązane artykuły Wszystkie wersje 5

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

Survey of the state of the art in natural language generation: Core tasks, applications and evaluation

A Gatt, E Krahmer - Journal of Artificial Intelligence Research, 2018 - jair.org

This paper surveys the current state of the art in Natural Language Generation (NLG),
defined as the task of generating text or speech from non-linguistic input. A survey of NLG is …

Zapisz Cytuj Cytowane przez 1147 Powiązane artykuły Wszystkie wersje 15 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Spice: Semantic propositional image caption evaluation

P Anderson, B Fernando, M Johnson… - Computer Vision–ECCV …, 2016 - Springer

There is considerable interest in the task of automatically generating image captions.
However, evaluation is challenging. Existing automatic evaluation metrics are primarily …

Zapisz Cytuj Cytowane przez 2349 Powiązane artykuły Wszystkie wersje 13

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Towards diverse and natural image descriptions via a conditional gan

B Dai, S Fidler, R Urtasun, D Lin - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

Despite the substantial progress in recent years, the problem of image captioning remains
far from being satisfactorily tackled. Sentences produced by existing methods, eg those …

Zapisz Cytuj Cytowane przez 798 Powiązane artykuły Wszystkie wersje 10 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Show and tell: Lessons learned from the 2015 mscoco image captioning challenge

O Vinyals, A Toshev, S Bengio… - IEEE transactions on …, 2016 - ieeexplore.ieee.org

Automatically describing the content of an image is a fundamental problem in artificial
intelligence that connects computer vision and natural language processing. In this paper …

Zapisz Cytuj Cytowane przez 1153 Powiązane artykuły Wszystkie wersje 20

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Boosting image captioning with attributes

T Yao, Y Pan, Y Li, Z Qiu, T Mei - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

Automatically describing an image with a natural language has been an emerging
challenge in both fields of computer vision and natural language processing. In this paper …

Zapisz Cytuj Cytowane przez 848 Powiązane artykuły Wszystkie wersje 9 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] hw.ac.uk

Summarizing source code using a neural attention model

S Iyer, I Konstas, A Cheung… - 54th Annual Meeting …, 2016 - researchportal.hw.ac.uk

High quality source code is often paired with high level summaries of the computation it
performs, for example in code documentation or in descriptions posted in online forums …

Zapisz Cytuj Cytowane przez 893 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Language models for image captioning: The quirks and what works

A comprehensive survey of deep learning for image captioning

Multimodal machine learning: A survey and taxonomy

Stacked cross attention for image-text matching

Multimodal transformer with multi-view visual representation for image captioning

Survey of the state of the art in natural language generation: Core tasks, applications and evaluation

Spice: Semantic propositional image caption evaluation

Towards diverse and natural image descriptions via a conditional gan

Show and tell: Lessons learned from the 2015 mscoco image captioning challenge

Boosting image captioning with attributes

Summarizing source code using a neural attention model