Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey
Abstract Automatic Visual Captioning (AVC) generates syntactically and semantically correct
sentences by describing important objects, attributes, and their relationships with each other …
sentences by describing important objects, attributes, and their relationships with each other …
A survey on advancements in image-text multimodal models: From general techniques to biomedical implementations
With the significant advancements of Large Language Models (LLMs) in the field of Natural
Language Processing (NLP), the development of image-text multimodal models has …
Language Processing (NLP), the development of image-text multimodal models has …
Adaptive path selection for dynamic image captioning
Image captioning is a challenging task, ie, given an image machine automatically generates
natural language that matches its semantic content and has attracted much attention in …
natural language that matches its semantic content and has attracted much attention in …
Split, embed and merge: An accurate table structure recognizer
Table structure recognition is an essential part for making machines understand tables. Its
main task is to recognize the internal structure of a table. However, due to the complexity …
main task is to recognize the internal structure of a table. However, due to the complexity …
Exploring fine-grained image-text alignment for referring remote sensing image segmentation
Given a language expression, referring remote sensing image segmentation (RRSIS) aims
to identify ground objects and assign pixelwise labels within the imagery. One of the key …
to identify ground objects and assign pixelwise labels within the imagery. One of the key …
Transformer-based local-global guidance for image captioning
Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …
amounts of images into descriptive languages. The recurrent models are popularly used as …
A multi-layer memory sharing network for video captioning
Over the past several years, video captioning has received much attention in computer
vision and machine learning communities. Many models utilize an RNN-based decoder to …
vision and machine learning communities. Many models utilize an RNN-based decoder to …
Protect, show, attend and tell: Empowering image captioning models with ownership protection
By and large, existing Intellectual Property (IP) protection on deep neural networks typically
i) focus on image classification task only, and ii) follow a standard digital watermarking …
i) focus on image classification task only, and ii) follow a standard digital watermarking …
Image captioning using transformer-based double attention network
Image captioning generates a human-like description for a query image, which has attracted
considerable attention recently. The most broadly utilized model for image description is an …
considerable attention recently. The most broadly utilized model for image description is an …
Divergent-convergent attention for image captioning
Attention mechanism has made great progress in image captioning, where semantic words
or local regions are selectively embedded into the language model. However, current …
or local regions are selectively embedded into the language model. However, current …