Google Наука

J Zhang, J Huang, S **, S Lu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Most visual recognition studies rely heavily on crowd-labelled data in deep neural networks
(DNNs) training, and they usually train a DNN for each single visual recognition task …

Запазване Позоваване С позовавания в 477 Сродни статии Всички 11 версии

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Dual memory networks: A versatile adaptation approach for vision-language models

Y Zhang, W Zhu, H Tang, Z Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com

With the emergence of pre-trained vision-language models like CLIP how to adapt them to
various downstream classification tasks has garnered significant attention in recent …

Запазване Позоваване С позовавания в 23 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Graphadapter: Tuning vision-language models with dual knowledge graph

X Li, D Lian, Z Lu, J Bai, Z Chen… - Advances in Neural …, 2023 - proceedings.neurips.cc

Adapter-style efficient transfer learning (ETL) has shown excellent performance in the tuning
of vision-language models (VLMs) under the low-data regime, where only a few additional …

Запазване Позоваване С позовавания в 57 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Adapting visual-language models for generalizable anomaly detection in medical images

C Huang, A Jiang, J Feng, Y Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent advancements in large-scale visual-language pre-trained models have led to
significant progress in zero-/few-shot anomaly detection within natural image domains …

Запазване Позоваване С позовавания в 28 Сродни статии Всички 7 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Low-rank few-shot adaptation of vision-language models

M Zanella, I Ben Ayed - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Recent progress in the few-shot adaptation of Vision-Language Models (VLMs) has further
pushed their generalization capabilities at the expense of just a few labeled samples within …

Запазване Позоваване С позовавания в 23 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Sg-former: Self-guided transformer with evolving token reallocation

S Ren, X Yang, S Liu, X Wang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Vision Transformer has demonstrated impressive success across various vision tasks.
However, its heavy computation cost, which grows quadratically with respect to the token …

Запазване Позоваване С позовавания в 43 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Auxiliary tasks benefit 3d skeleton-based human motion prediction

C Xu, RT Tan, Y Tan, S Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Exploring spatial-temporal dependencies from observed motions is one of the core
challenges of human motion prediction. Previous methods mainly focus on dedicated …

Запазване Позоваване С позовавания в 34 Сродни статии Всички 7 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

A closer look at the few-shot adaptation of large vision-language models

J Silva-Rodriguez, S Hajimiri… - Proceedings of the …, 2024 - openaccess.thecvf.com

Efficient transfer learning (ETL) is receiving increasing attention to adapt large pre-trained
language-vision models on downstream tasks with a few labeled samples. While significant …

Запазване Позоваване С позовавания в 30 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification

J Shi, C Li, T Gong, Y Zheng… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Multiple instance learning (MIL)-based framework has become the mainstream for
processing the whole slide image (WSI) with giga-pixel size and hierarchical image context …

Запазване Позоваване С позовавания в 14 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Efficient test-time adaptation of vision-language models

A Karmanov, D Guan, S Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Test-time adaptation with pre-trained vision-language models has attracted increasing
attention for tackling distribution shifts during the test time. Though prior studies have …

Запазване Позоваване С позовавания в 30 Сродни статии Всички 7 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Task residual for tuning vision-language models

Vision-language models for vision tasks: A survey

Dual memory networks: A versatile adaptation approach for vision-language models

Graphadapter: Tuning vision-language models with dual knowledge graph

Adapting visual-language models for generalizable anomaly detection in medical images

Low-rank few-shot adaptation of vision-language models

Sg-former: Self-guided transformer with evolving token reallocation

Auxiliary tasks benefit 3d skeleton-based human motion prediction

A closer look at the few-shot adaptation of large vision-language models

ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification

Efficient test-time adaptation of vision-language models