Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Clip in medical imaging: A comprehensive survey
Contrastive Language-Image Pre-training (CLIP), a simple yet effective pre-training
paradigm, successfully introduces text supervision to vision models. It has shown promising …
paradigm, successfully introduces text supervision to vision models. It has shown promising …
Cross-modal retrieval: a systematic review of methods and future directions
With the exponential surge in diverse multimodal data, traditional unimodal retrieval
methods struggle to meet the needs of users seeking access to data across various …
methods struggle to meet the needs of users seeking access to data across various …
A survey on rag meeting llms: Towards retrieval-augmented large language models
As one of the most advanced techniques in AI, Retrieval-Augmented Generation (RAG) can
offer reliable and up-to-date external knowledge, providing huge convenience for numerous …
offer reliable and up-to-date external knowledge, providing huge convenience for numerous …
Retrieval-augmented generation for ai-generated content: A survey
The development of Artificial Intelligence Generated Content (AIGC) has been facilitated by
advancements in model algorithms, scalable foundation model architectures, and the …
advancements in model algorithms, scalable foundation model architectures, and the …
Retrieval-augmented multimodal language modeling
Recent multimodal models such as DALL-E and CM3 have achieved remarkable progress
in text-to-image and image-to-text generation. However, these models store all learned …
in text-to-image and image-to-text generation. However, these models store all learned …
Wiki-llava: Hierarchical retrieval-augmented generation for multimodal llms
Multimodal LLMs are the natural evolution of LLMs and enlarge their capabilities so as to
work beyond the pure textual modality. As research is being carried out to design novel …
work beyond the pure textual modality. As research is being carried out to design novel …
Meacap: Memory-augmented zero-shot image captioning
Zero-shot image captioning (IC) without well-paired image-text data can be categorized into
two main types: training-free and text-only-training methods. While both types integrate pre …
two main types: training-free and text-only-training methods. While both types integrate pre …
Transferable decoding with visual entities for zero-shot image captioning
Image-to-text generation aims to describe images using natural language. Recently, zero-
shot image captioning based on pre-trained vision-language models (VLMs) and large …
shot image captioning based on pre-trained vision-language models (VLMs) and large …
Visual-augmented dynamic semantic prototype for generative zero-shot learning
Generative Zero-shot learning (ZSL) learns a generator to synthesize visual samples for
unseen classes which is an effective way to advance ZSL. However existing generative …
unseen classes which is an effective way to advance ZSL. However existing generative …
Exploring diverse in-context configurations for image captioning
Abstract After discovering that Language Models (LMs) can be good in-context few-shot
learners, numerous strategies have been proposed to optimize in-context sequence …
learners, numerous strategies have been proposed to optimize in-context sequence …