Graphadapter: Tuning vision-language models with dual knowledge graph

X Li, D Lian, Z Lu, J Bai, Z Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Adapter-style efficient transfer learning (ETL) has shown excellent performance in the tuning
of vision-language models (VLMs) under the low-data regime, where only a few additional …

Molo: Motion-augmented long-short contrastive learning for few-shot action recognition

X Wang, S Zhang, Z Qing, C Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current state-of-the-art approaches for few-shot action recognition achieve promising
performance by conducting frame-level matching on learned visual features. However, they …

A comprehensive review of few-shot action recognition

Y Wanyan, X Yang, W Dong, C Xu - arxiv preprint arxiv:2407.14744, 2024 - arxiv.org
Few-shot action recognition aims to address the high cost and impracticality of manually
labeling complex and variable video data in action recognition. It requires accurately …

Multi-view distillation based on multi-modal fusion for few-shot action recognition (CLIP-MDMF)

F Guo, YK Wang, H Qi, W **, L Zhu, J Sun - Knowledge-Based Systems, 2024 - Elsevier
In recent years, the field of few-shot action recognition (FSAR) has garnered significant
attention. Although many methods primarily rely on mono-modal data, there is a growing …

Multimodal adaptation of clip for few-shot action recognition

J **ng, M Wang, X Hou, G Dai, J Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Applying large-scale pre-trained visual models like CLIP to few-shot action recognition tasks
can benefit performance and efficiency. Utilizing the" pre-training, fine-tuning" paradigm …

Exploring sample relationship for few-shot classification

X Chen, W Wu, L Ma, X You, C Gao, N Sang, Y Shao - Pattern Recognition, 2025 - Elsevier
Few-shot classification (FSC) is a challenging problem, which aims to identify novel classes
with limited samples. Most existing methods employ vanilla transfer learning or episodic …

Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition

B Li, M Liu, G Wang, Y Yu - arxiv preprint arxiv:2408.12475, 2024 - arxiv.org
In this paper, we propose a novel Temporal Sequence-Aware Model (TSAM) for few-shot
action recognition (FSAR), which incorporates a sequential perceiver adapter into the pre …

Multimodal prototype-enhanced network for few-shot action recognition

X Ni, Y Liu, H Wen, Y Ji, J **ao, Y Yang - Proceedings of the 2024 …, 2024 - dl.acm.org
Current methods for few-shot action recognition mainly fall into the metric learning
framework following ProtoNet, which demonstrates the importance of prototypes. Although …

Task-Adapter: Task-specific Adaptation of Image Models for Few-shot Action Recognition

C Cao, Y Zhang, Y Yu, Q Lv, L Min… - Proceedings of the 32nd …, 2024 - dl.acm.org
Existing works in few-shot action recognition mostly fine-tune a pre-trained image model and
design sophisticated temporal alignment modules at feature level. However, simply fully fine …

MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition

H Qu, R Yan, X Shu, H Gao, P Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent few-shot action recognition (FSAR) methods typically perform semantic matching on
learned discriminative features to achieve promising performance. However, most FSAR …