- Academic Search

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org

As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …

Save Cite Cited by 206 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

From show to tell: A survey on deep learning-based image captioning

M Stefanini, M Cornia, L Baraldi… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …

Save Cite Cited by 396 Related articles All 11 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Meshed-memory transformer for image captioning

M Cornia, M Stefanini, L Baraldi… - Proceedings of the …, 2020 - openaccess.thecvf.com

Transformer-based architectures represent the state of the art in sequence modeling tasks
like machine translation and language understanding. Their applicability to multi-modal …

Save Cite Cited by 1202 Related articles All 13 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Grit: Faster and better image captioning transformer using dual visual features

VQ Nguyen, M Suganuma, T Okatani - European Conference on Computer …, 2022 - Springer

Current state-of-the-art methods for image captioning employ region-based features, as they
provide object-level information that is essential to describe the content of images; they are …

Save Cite Cited by 131 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Cptr: Full transformer network for image captioning

W Liu, S Chen, L Guo, X Zhu, J Liu - arxiv preprint arxiv:2101.10804, 2021 - arxiv.org

In this paper, we consider the image captioning task from a new sequence-to-sequence
prediction perspective and propose CaPtion TransformeR (CPTR) which takes the …

Save Cite Cited by 220 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

Improving image captioning by leveraging intra-and inter-layer global representation in transformer network

J Ji, Y Luo, X Sun, F Chen, G Luo, Y Wu… - Proceedings of the AAAI …, 2021 - ojs.aaai.org

Transformer-based architectures have shown great success in image captioning, where
object regions are encoded and then attended into the vectorial representations to guide the …

Save Cite Cited by 191 Related articles All 6 versions Free GPT-4 View as HTML

Region-aware image captioning via interaction learning

AA Liu, Y Zhai, N Xu, W Nie, W Li… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Image captioning is one of the primary goals in computer vision which aims to automatically
generate natural descriptions for images. Intuitively, human visual system can notice some …

Save Cite Cited by 116 Related articles

[Free GPT-4]

[PDF] arxiv.org

Image Captioning in news report scenario

T Liu, Q Cai, C Xu, B Hong, J **ong, Y Qiao… - arxiv preprint arxiv …, 2024 - arxiv.org

Image captioning strives to generate pertinent captions for specified images, situating itself
at the crossroads of Computer Vision (CV) and Natural Language Processing (NLP). This …

Save Cite Cited by 54 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Beyond a pre-trained object detector: Cross-modal textual and visual context for image captioning

CW Kuo, Z Kira - Proceedings of the IEEE/CVF conference …, 2022 - openaccess.thecvf.com

Significant progress has been made on visual captioning, largely relying on pre-trained
features and later fixed object detectors that serve as rich inputs to auto-regressive models …

Save Cite Cited by 76 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jair.org Full View

Trends in integration of vision and language research: A survey of tasks, datasets, and methods

A Mogadala, M Kalimuthu, D Klakow - Journal of Artificial Intelligence …, 2021 - jair.org

Abstract Interest in Artificial Intelligence (AI) and its applications has seen unprecedented
growth in the last few years. This success can be partly attributed to the advancements made …

Save Cite Cited by 162 Related articles All 8 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Reflective decoding network for image captioning

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?

From show to tell: A survey on deep learning-based image captioning

Meshed-memory transformer for image captioning

Grit: Faster and better image captioning transformer using dual visual features

Cptr: Full transformer network for image captioning

Improving image captioning by leveraging intra-and inter-layer global representation in transformer network

Region-aware image captioning via interaction learning

Image Captioning in news report scenario

Beyond a pre-trained object detector: Cross-modal textual and visual context for image captioning

Trends in integration of vision and language research: A survey of tasks, datasets, and methods