Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
From show to tell: A survey on deep learning-based image captioning
Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …
reason, large research efforts have been devoted to image captioning, ie describing images …
A comprehensive survey of deep learning for image captioning
Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …
recognizing the important objects, their attributes, and their relationships in an image. It also …
Survey of the state of the art in natural language generation: Core tasks, applications and evaluation
This paper surveys the current state of the art in Natural Language Generation (NLG),
defined as the task of generating text or speech from non-linguistic input. A survey of NLG is …
defined as the task of generating text or speech from non-linguistic input. A survey of NLG is …
Show and tell: Lessons learned from the 2015 mscoco image captioning challenge
Automatically describing the content of an image is a fundamental problem in artificial
intelligence that connects computer vision and natural language processing. In this paper …
intelligence that connects computer vision and natural language processing. In this paper …
Microsoft coco captions: Data collection and evaluation server
In this paper we describe the Microsoft COCO Caption dataset and evaluation server. When
completed, the dataset will contain over one and a half million captions describing over …
completed, the dataset will contain over one and a half million captions describing over …
Show, attend and tell: Neural image caption generation with visual attention
Inspired by recent work in machine translation and object detection, we introduce an
attention based model that automatically learns to describe the content of images. We …
attention based model that automatically learns to describe the content of images. We …
Deep visual-semantic alignments for generating image descriptions
We present a model that generates natural language descriptions of images and their
regions. Our approach leverages datasets of images and their sentence descriptions to …
regions. Our approach leverages datasets of images and their sentence descriptions to …
Long-term recurrent convolutional networks for visual recognition and description
Abstract Models comprised of deep convolutional network layers have dominated recent
image interpretation tasks; we investigate whether models which are also compositional, or" …
image interpretation tasks; we investigate whether models which are also compositional, or" …
Show and tell: A neural image caption generator
Automatically describing the content of an image is a fundamental problem in artificial
intelligence that connects computer vision and natural language processing. In this paper …
intelligence that connects computer vision and natural language processing. In this paper …
Unifying visual-semantic embeddings with multimodal neural language models
Inspired by recent advances in multimodal learning and machine translation, we introduce
an encoder-decoder pipeline that learns (a): a multimodal joint embedding space with …
an encoder-decoder pipeline that learns (a): a multimodal joint embedding space with …