Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Normalized and geometry-aware self-attention network for image captioning
Self-attention (SA) network has shown profound value in image captioning. In this paper, we
improve SA from two aspects to promote the performance of image captioning. First, we …
improve SA from two aspects to promote the performance of image captioning. First, we …
Trends in integration of vision and language research: A survey of tasks, datasets, and methods
Abstract Interest in Artificial Intelligence (AI) and its applications has seen unprecedented
growth in the last few years. This success can be partly attributed to the advancements made …
growth in the last few years. This success can be partly attributed to the advancements made …
Adaptive path selection for dynamic image captioning
Image captioning is a challenging task, ie, given an image machine automatically generates
natural language that matches its semantic content and has attracted much attention in …
natural language that matches its semantic content and has attracted much attention in …
Vision-enhanced and consensus-aware transformer for image captioning
Image captioning generates descriptions in a natural language for a given image. Due to its
great potential for a wide range of applications, many deep learning based-methods have …
great potential for a wide range of applications, many deep learning based-methods have …
Joint embedding of deep visual and semantic features for medical image report generation
Medical image report generation (MeIRG) aims at generating associated diagnosis
descriptions with natural language sentences from medical images, which is essential in the …
descriptions with natural language sentences from medical images, which is essential in the …
Prompt-based learning for unpaired image captioning
Unpaired Image Captioning (UIC) has been developed to learn image descriptions from
unaligned vision-language sample pairs. Existing works usually tackle this task using …
unaligned vision-language sample pairs. Existing works usually tackle this task using …
Visual cluster grounding for image captioning
Attention mechanisms have been extensively adopted in vision and language tasks such as
image captioning. It encourages a captioning model to dynamically ground appropriate …
image captioning. It encourages a captioning model to dynamically ground appropriate …
Image difference captioning with instance-level fine-grained feature representation
The task of image difference captioning aims at locating changed objects in similar image
pairs and describing the difference with natural language. The key challenges of this task …
pairs and describing the difference with natural language. The key challenges of this task …
Dual attention on pyramid feature maps for image captioning
Generating natural sentences from images is a fundamental learning task for visual-
semantic understanding in multimedia. In this paper, we propose to apply dual attention on …
semantic understanding in multimedia. In this paper, we propose to apply dual attention on …
Deep reinforcement polishing network for video captioning
W Xu, J Yu, Z Miao, L Wan, Y Tian… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The video captioning task aims to describe video content using several natural-language
sentences. Although one-step encoder-decoder models have achieved promising progress …
sentences. Although one-step encoder-decoder models have achieved promising progress …