Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Vision-language models for vision tasks: A survey
Most visual recognition studies rely heavily on crowd-labelled data in deep neural networks
(DNNs) training, and they usually train a DNN for each single visual recognition task …
(DNNs) training, and they usually train a DNN for each single visual recognition task …
Dual memory networks: A versatile adaptation approach for vision-language models
With the emergence of pre-trained vision-language models like CLIP how to adapt them to
various downstream classification tasks has garnered significant attention in recent …
various downstream classification tasks has garnered significant attention in recent …
Graphadapter: Tuning vision-language models with dual knowledge graph
Adapter-style efficient transfer learning (ETL) has shown excellent performance in the tuning
of vision-language models (VLMs) under the low-data regime, where only a few additional …
of vision-language models (VLMs) under the low-data regime, where only a few additional …
Adapting visual-language models for generalizable anomaly detection in medical images
Recent advancements in large-scale visual-language pre-trained models have led to
significant progress in zero-/few-shot anomaly detection within natural image domains …
significant progress in zero-/few-shot anomaly detection within natural image domains …
Low-rank few-shot adaptation of vision-language models
Recent progress in the few-shot adaptation of Vision-Language Models (VLMs) has further
pushed their generalization capabilities at the expense of just a few labeled samples within …
pushed their generalization capabilities at the expense of just a few labeled samples within …
Sg-former: Self-guided transformer with evolving token reallocation
Vision Transformer has demonstrated impressive success across various vision tasks.
However, its heavy computation cost, which grows quadratically with respect to the token …
However, its heavy computation cost, which grows quadratically with respect to the token …
Auxiliary tasks benefit 3d skeleton-based human motion prediction
Exploring spatial-temporal dependencies from observed motions is one of the core
challenges of human motion prediction. Previous methods mainly focus on dedicated …
challenges of human motion prediction. Previous methods mainly focus on dedicated …
A closer look at the few-shot adaptation of large vision-language models
Efficient transfer learning (ETL) is receiving increasing attention to adapt large pre-trained
language-vision models on downstream tasks with a few labeled samples. While significant …
language-vision models on downstream tasks with a few labeled samples. While significant …
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification
Multiple instance learning (MIL)-based framework has become the mainstream for
processing the whole slide image (WSI) with giga-pixel size and hierarchical image context …
processing the whole slide image (WSI) with giga-pixel size and hierarchical image context …
Efficient test-time adaptation of vision-language models
Test-time adaptation with pre-trained vision-language models has attracted increasing
attention for tackling distribution shifts during the test time. Though prior studies have …
attention for tackling distribution shifts during the test time. Though prior studies have …