Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[HTML][HTML] Review of large vision models and visual prompt engineering
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …
artificial general intelligence. As the development of large vision models progresses, the …
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation
Hallucination posed as a pervasive challenge of multi-modal large language models
(MLLMs) has significantly impeded their real-world usage that demands precise judgment …
(MLLMs) has significantly impeded their real-world usage that demands precise judgment …
A systematic survey of prompt engineering on vision-language foundation models
Prompt engineering is a technique that involves augmenting a large pre-trained model with
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …
Multimodal prompt perceiver: Empower adaptiveness generalizability and fidelity for all-in-one image restoration
Despite substantial progress all-in-one image restoration (IR) grapples with persistent
challenges in handling intricate real-world degradations. This paper introduces MPerceiver …
challenges in handling intricate real-world degradations. This paper introduces MPerceiver …
Parameter-efficient fine-tuning for pre-trained vision models: A survey
Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability
across various downstream vision tasks. However, with state-of-the-art PVMs growing to …
across various downstream vision tasks. However, with state-of-the-art PVMs growing to …
Dept: Decoupled prompt tuning
This work breaks through the Base-New Tradeoff (BNT) dilemma in prompt tuning ie the
better the tuned model generalizes to the base (or target) task the worse it generalizes to …
better the tuned model generalizes to the base (or target) task the worse it generalizes to …
Pro-tuning: Unified prompt tuning for vision tasks
In computer vision, fine-tuning is the de-facto approach to leverage pre-trained vision
models to perform downstream tasks. However, deploying it in practice is quite challenging …
models to perform downstream tasks. However, deploying it in practice is quite challenging …
Not all prompts are secure: A switchable backdoor attack against pre-trained vision transfomers
Given the power of vision transformers a new learning paradigm pre-training and then
prompting makes it more efficient and effective to address downstream visual recognition …
prompting makes it more efficient and effective to address downstream visual recognition …
Exploring autonomous agents through the lens of large language models: A review
S Barua - arxiv preprint arxiv:2404.04442, 2024 - arxiv.org
Large Language Models (LLMs) are transforming artificial intelligence, enabling
autonomous agents to perform diverse tasks across various domains. These agents …
autonomous agents to perform diverse tasks across various domains. These agents …
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
JJ Wu, ACH Chang, CY Chuang… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper addresses text-supervised semantic segmentation aiming to learn a model
capable of segmenting arbitrary visual concepts within images by using only image-text …
capable of segmenting arbitrary visual concepts within images by using only image-text …