Video test-time adaptation for action recognition

W Lin, MJ Mirza, M Kozinski… - Proceedings of the …, 2023 - openaccess.thecvf.com
Although action recognition systems can achieve top performance when evaluated on in-
distribution test points, they are vulnerable to unanticipated distribution shifts in test data …

No more shortcuts: Realizing the potential of temporal self-supervision

IR Dave, S Jenni, M Shah - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Self-supervised approaches for video have shown impressive results in video
understanding tasks. However, unlike early works that leverage temporal self-supervision …

Foundation models for video understanding: A survey

N Madan, A Møgelmose, R Modi, YS Rawat… - Authorea …, 2024 - techrxiv.org
Video Foundation Models (ViFMs) aim to develop general-purpose representations for
various video understanding tasks by leveraging large-scale datasets and powerful models …

Benchmarking the robustness of temporal action detection models against temporal corruptions

R Zeng, X Chen, J Liang, H Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Temporal action detection (TAD) aims to locate action positions and recognize action
categories in long-term untrimmed videos. Although many methods have achieved …

Eventtransact: A video transformer-based framework for event-camera based action recognition

T de Blegiers, IR Dave, A Yousaf… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org
Recognizing and comprehending human actions and gestures is a crucial perception
requirement for robots to interact with humans and carry out tasks in diverse domains …

Uncovering the hidden dynamics of video self-supervised learning under distribution shifts

P Sarkar, A Beirami, A Etemad - Advances in Neural …, 2023 - proceedings.neurips.cc
Video self-supervised learning (VSSL) has made significant progress in recent years.
However, the exact behavior and dynamics of these models under different forms of …

Exploring motion cues for video test-time adaptation

R Zeng, Q Deng, H Xu, S Niu, J Chen - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Test-time adaptation (TTA) aims at boosting the generalization capability of a trained model
by conducting self-/un-supervised learning during testing in real-world applications. Though …

SMART-vision: survey of modern action recognition techniques in vision

AK AlShami, R Rabinowitz, K Lam, Y Shleibik… - Multimedia Tools and …, 2024 - Springer
Abstract Human Action Recognition (HAR) is a challenging domain in computer vision,
involving recognizing complex patterns by analyzing the spatiotemporal dynamics of …

ST2ST: Self-Supervised Test-time Adaptation for Video Action Recognition

MANI Fahim, M Innat… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
The performance of trained deep neural network (DNN) models relies on the assumption
that the test data has largely the same feature distribution as the training data. In deployed …

Videoeval: Comprehensive benchmark suite for low-cost evaluation of video foundation model

X Li, Z Huang, J Wang, K Li, L Wang - arxiv preprint arxiv:2407.06491, 2024 - arxiv.org
With the growth of high-quality data and advancement in visual pre-training paradigms,
Video Foundation Models (VFMs) have made significant progress recently, demonstrating …