Interact before align: Leveraging cross-modal knowledge for domain adaptive action recognition

L Yang, Y Huang, Y Sugano… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Unsupervised domain adaptive video action recognition aims to recognize actions of a
target domain using a model trained with only out-of-domain (source) annotations. The …

Dual alignment unsupervised domain adaptation for video-text retrieval

X Hao, W Zhang, D Wu, F Zhu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Video-text retrieval is an emerging stream in both computer vision and natural language
processing communities, which aims to find relevant videos given text queries. In this paper …

Domain generalization through audio-visual relative norm alignment in first person action recognition

M Planamente, C Plizzari, E Alberti… - Proceedings of the …, 2022 - openaccess.thecvf.com
First person action recognition is becoming an increasingly researched area thanks to the
rising popularity of wearable cameras. This is bringing to light cross-domain issues that are …

Relative norm alignment for tackling domain shift in deep multi-modal classification

M Planamente, C Plizzari, SA Peirone… - International Journal of …, 2024 - Springer
Multi-modal learning has gained significant attention due to its ability to enhance machine
learning algorithms. However, it brings challenges related to modality heterogeneity and …

Adamsformer for spatial action localization in the future

H Chi, K Lee, N Agarwal, Y Xu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Predicting future action locations is vital for applications like human-robot collaboration.
While some computer vision tasks have made progress in predicting human actions …

Exploiting instance-based mixed sampling via auxiliary source domain supervision for domain-adaptive action detection

Y Lu, G Singh, S Saha… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose a novel domain adaptive action detection approach and a new adaptation
protocol that leverages the recent advancements in image-level unsupervised domain …

Domain adaptation in multi-view embedding for cross-modal video retrieval

J Munro, M Wray, D Larlus, G Csurka… - arxiv preprint arxiv …, 2021 - arxiv.org
Given a gallery of uncaptioned video sequences, this paper considers the task of retrieving
videos based on their relevance to an unseen text query. To compensate for the lack of …

Toward human-robot cooperation: Unsupervised domain adaptation for egocentric action recognition

M Planamente, G Goletto, G Trivigno, G Averta… - … Workshop on Human …, 2022 - Springer
With the advent of collaborative manipulators, the community is pushing the limits of human-
robot interaction with novel control, planning, and task allocation strategies. For a purposeful …

About Time: Advances, Challenges, and Outlooks of Action Understanding

A Stergiou, R Poppe - arxiv preprint arxiv:2411.15106, 2024 - arxiv.org
We have witnessed impressive advances in video action understanding. Increased dataset
sizes, variability, and computation availability have enabled leaps in performance and task …

Object-based (yet Class-agnostic) Video Domain Adaptation

D Niu, A Bar, R Herzig, T Darrell… - arxiv preprint arxiv …, 2023 - arxiv.org
Existing video-based action recognition systems typically require dense annotation and
struggle in environments when there is significant distribution shift relative to the training …