Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

Video pivoting unsupervised multi-modal machine translation

M Li, PY Huang, X Chang, J Hu, Y Yang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
The main challenge in the field of unsupervised machine translation (UMT) is to associate
source-target sentences in the latent space. As people who speak different languages share …

Mask propagation for efficient video semantic segmentation

Y Weng, M Han, H He, M Li, L Yao… - Advances in …, 2024 - proceedings.neurips.cc
Abstract Video Semantic Segmentation (VSS) involves assigning a semantic label to each
pixel in a video sequence. Prior work in this field has demonstrated promising results by …

Spartan: Self-supervised spatiotemporal transformers approach to group activity recognition

NVS Chappa, P Nguyen, AH Nelson… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose a new, simple, and effective Self-supervised Spatio-temporal
Transformers (SPARTAN) approach to Group Activity Recognition (GAR) using unlabeled …

An efficient spatio-temporal pyramid transformer for action detection

Y Weng, Z Pan, M Han, X Chang, B Zhuang - European Conference on …, 2022 - Springer
The task of action detection aims at deducing both the action category and localization of the
start and end moment for each action instance in a long, untrimmed video. While vision …

Interaction-aware joint attention estimation using people attributes

C Nakatani, H Kawashima… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper proposes joint attention estimation in a single image. Different from related work
in which only the gaze-related attributes of people are independently employed,(I) their …

A survey of deep learning in sports applications: Perception, comprehension, and decision

Z Zhao, W Chai, S Hao, W Hu, G Wang, S Cao… - arxiv preprint arxiv …, 2023 - arxiv.org
Deep learning has the potential to revolutionize sports performance, with applications
ranging from perception and comprehension to decision. This paper presents a …

3D-unified spatial-temporal graph for group activity recognition

L Wang, W Feng, C Tian, L Chen, J Pei - Neurocomputing, 2023 - Elsevier
Early group activity recognition is typically conducted in a 2D scenario. This paper proposes
a group activity recognition method in a 3D space. In practical applications, differences in …

Mlst-former: Multi-level spatial-temporal transformer for group activity recognition

X Zhu, Y Zhou, D Wang, W Ouyang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Group activity recognition, which aims to simultaneously understand individual action and
group activity in video clips, plays a fundamental role in computer vision and video analysis …

Perceiving local relative motion and global correlations for weakly supervised group activity recognition

Z Du, X Wang, Q Wang - Image and Vision Computing, 2023 - Elsevier
This paper presents a weakly supervised approach for group activity recognition by
exploiting the local relative motion and global correlations among entities. Most existing …