Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Deep image captioning: A review of methods, trends and future challenges
Image captioning, also called report generation in medical field, aims to describe visual
content of images in human language, which requires to model semantic relationship …
content of images in human language, which requires to model semantic relationship …
Exploring deep learning-based architecture, strategies, applications and current trends in generic object detection: A comprehensive review
Object detection is a fundamental but challenging issue in the field of generic image
analysis; it plays an important role in a wide range of applications and has been receiving …
analysis; it plays an important role in a wide range of applications and has been receiving …
Graph neural networks: foundation, frontiers and applications
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …
recent years. Graph neural networks, also known as deep learning on graphs, graph …
Attribute prototype network for zero-shot learning
From the beginning of zero-shot learning research, visual attributes have been shown to
play an important role. In order to better transfer attribute-based knowledge from known to …
play an important role. In order to better transfer attribute-based knowledge from known to …
Occlusion aware facial expression recognition using CNN with attention mechanism
Facial expression recognition in the wild is challenging due to various unconstrained
conditions. Although existing facial expression classifiers have been almost perfect on …
conditions. Although existing facial expression classifiers have been almost perfect on …
Transferable attention for domain adaptation
Recent work in domain adaptation bridges different domains by adversarially learning a
domain-invariant representation that cannot be distinguished by a domain discriminator …
domain-invariant representation that cannot be distinguished by a domain discriminator …
High-resolution remote sensing image captioning based on structured attention
Automatically generating language descriptions of remote sensing images has become an
emerging research hot spot in the remote sensing field. Attention-based captioning, as a …
emerging research hot spot in the remote sensing field. Attention-based captioning, as a …
Bicro: Noisy correspondence rectification for multi-modality data via bi-directional cross-modal similarity consistency
As one of the most fundamental techniques in multimodal learning, cross-modal matching
aims to project various sensory modalities into a shared feature space. To achieve this …
aims to project various sensory modalities into a shared feature space. To achieve this …
Visual news: Benchmark and challenges in news image captioning
We propose Visual News Captioner, an entity-aware model for the task of news image
captioning. We also introduce Visual News, a large-scale benchmark consisting of more …
captioning. We also introduce Visual News, a large-scale benchmark consisting of more …
Global visual feature and linguistic state guided attention for remote sensing image captioning
Z Zhang, W Zhang, M Yan, X Gao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The encoder–decoder framework is prevalent in existing remote-sensing image captioning
(RSIC) models. The appearance of attention mechanisms brings significant results …
(RSIC) models. The appearance of attention mechanisms brings significant results …