Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A review on explainability in multimodal deep neural nets
Artificial Intelligence techniques powered by deep neural nets have achieved much success
in several application domains, most significantly and notably in the Computer Vision …
in several application domains, most significantly and notably in the Computer Vision …
Visualization and visual analytics approaches for image and video datasets: A survey
Image and video data analysis has become an increasingly important research area with
applications in different domains such as security surveillance, healthcare, augmented and …
applications in different domains such as security surveillance, healthcare, augmented and …
Multimodal few-shot learning with frozen language models
When trained at sufficient scale, auto-regressive language models exhibit the notable ability
to learn a new language task after being prompted with just a few examples. Here, we …
to learn a new language task after being prompted with just a few examples. Here, we …
Evaluation of text generation: A survey
The paper surveys evaluation methods of natural language generation (NLG) systems that
have been developed in the last few years. We group NLG evaluation methods into three …
have been developed in the last few years. We group NLG evaluation methods into three …
Learning with noisy correspondence for cross-modal matching
Cross-modal matching, which aims to establish the correspondence between two different
modalities, is fundamental to a variety of tasks such as cross-modal retrieval and vision-and …
modalities, is fundamental to a variety of tasks such as cross-modal retrieval and vision-and …
Bicro: Noisy correspondence rectification for multi-modality data via bi-directional cross-modal similarity consistency
As one of the most fundamental techniques in multimodal learning, cross-modal matching
aims to project various sensory modalities into a shared feature space. To achieve this …
aims to project various sensory modalities into a shared feature space. To achieve this …
A review of deep learning for video captioning
Video captioning (VC) is a fast-moving, cross-disciplinary area of research that comprises
contributions from domains such as computer vision, natural language processing …
contributions from domains such as computer vision, natural language processing …
Video description: A comprehensive survey of deep learning approaches
Video description refers to understanding visual content and transforming that acquired
understanding into automatic textual narration. It bridges the key AI fields of computer vision …
understanding into automatic textual narration. It bridges the key AI fields of computer vision …
Generative AI in mobile networks: a survey
This paper provides a comprehensive review of recent challenges and results in the field of
generative AI with application to mobile telecommunications networks. The objective is to …
generative AI with application to mobile telecommunications networks. The objective is to …
PSNet: Parallel symmetric network for video salient object detection
For the video salient object detection (VSOD) task, how to excavate the information from the
appearance modality and the motion modality has always been a topic of great concern. The …
appearance modality and the motion modality has always been a topic of great concern. The …