Hybrid relation guided set matching for few-shot action recognition

X Wang, S Zhang, Z Qing, M Tang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Current few-shot action recognition methods reach impressive performance by learning
discriminative features for each video via episodic training and designing various temporal …

Molo: Motion-augmented long-short contrastive learning for few-shot action recognition

X Wang, S Zhang, Z Qing, C Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current state-of-the-art approaches for few-shot action recognition achieve promising
performance by conducting frame-level matching on learned visual features. However, they …

Video action understanding

MS Hutchinson, VN Gadepally - IEEE Access, 2021 - ieeexplore.ieee.org
Many believe that the successes of deep learning on image understanding problems can be
replicated in the realm of video understanding. However, due to the scale and temporal …

Unimd: Towards unifying moment retrieval and temporal action detection

Y Zeng, Y Zhong, C Feng, L Ma - European Conference on Computer …, 2024 - Springer
Abstract Temporal Action Detection (TAD) focuses on detecting pre-defined actions, while
Moment Retrieval (MR) aims to identify the events described by open-ended natural …

Low-fidelity video encoder optimization for temporal action localization

M Xu, JM Perez Rua, X Zhu… - Advances in Neural …, 2021 - proceedings.neurips.cc
Most existing temporal action localization (TAL) methods rely on a transfer learning pipeline:
by first optimizing a video encoder on a large action classification dataset (ie, source …

Videoglue: Video general understanding evaluation of foundation models

L Yuan, NB Gundavarapu, L Zhao, H Zhou… - arxiv preprint arxiv …, 2023 - arxiv.org
We evaluate existing foundation models video understanding capabilities using a carefully
designed experiment protocol consisting of three hallmark tasks (action recognition …

Action sensitivity learning for temporal action localization

J Shao, X Wang, R Quan, J Zheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Temporal action localization (TAL), which involves recognizing and locating action
instances, is a challenging task in video understanding. Most existing approaches directly …

Cross-domain few-shot action recognition with unlabeled videos

X Wang, S Zhang, Z Qing, Y Lv, C Gao… - Computer Vision and …, 2023 - Elsevier
Current few-shot action recognition approaches have achieved impressive performance
using only a few labeled examples. However, they usually assume the base (train) and …

Svip: Sequence verification for procedures in videos

Y Qian, W Luo, D Lian, X Tang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose a novel sequence verification task that aims to distinguish positive
video pairs performing the same action sequence from negative ones with step-level …

Hr-pro: Point-supervised temporal action localization via hierarchical reliability propagation

H Zhang, X Wang, X Xu, Z Qing, C Gao… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Point-supervised Temporal Action Localization (PSTAL) is an emerging research direction
for label-efficient learning. However, current methods mainly focus on optimizing the network …