Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

Accurate medium-range global weather forecasting with 3D neural networks

K Bi, L **e, H Zhang, X Chen, X Gu, Q Tian - Nature, 2023 - nature.com
Weather forecasting is important for science and society. At present, the most accurate
forecast system is the numerical weather prediction (NWP) method, which represents …

Videomae v2: Scaling video masked autoencoders with dual masking

L Wang, B Huang, Z Zhao, Z Tong… - Proceedings of the …, 2023 - openaccess.thecvf.com
Scale is the primary factor for building a powerful foundation model that could well
generalize to a variety of downstream tasks. However, it is still challenging to train video …

Tdn: Temporal difference networks for efficient action recognition

L Wang, Z Tong, B Ji, G Wu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Temporal modeling still remains challenging for action recognition in videos. To mitigate this
issue, this paper presents a new video architecture, termed as Temporal Difference Network …

A comprehensive study of deep video action recognition

Y Zhu, X Li, C Liu, M Zolfaghari, Y **ong, C Wu… - arxiv preprint arxiv …, 2020 - arxiv.org
Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …

Video understanding with large language models: A survey

Y Tang, J Bi, S Xu, L Song, S Liang, T Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
With the burgeoning growth of online video platforms and the escalating volume of video
content, the demand for proficient video understanding tools has intensified markedly. Given …

A comprehensive survey of rgb-based and skeleton-based human action recognition

C Wang, J Yan - IEEE Access, 2023 - ieeexplore.ieee.org
With the advancement of computer vision, human action recognition (HAR) has shown its
broad research worth and application prospects in a wide range of fields such as intelligent …

Delving into the local: Dynamic inconsistency learning for deepfake video detection

Z Gu, Y Chen, T Yao, S Ding, J Li, L Ma - Proceedings of the AAAI …, 2022 - ojs.aaai.org
The rapid development of facial manipulation techniques has aroused public concerns in
recent years. Existing deepfake video detection approaches attempt to capture the discrim …

Video contrastive learning with global context

H Kuang, Y Zhu, Z Zhang, X Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
Contrastive learning has revolutionized the self-supervised image representation learning
field and recently been adapted to the video domain. One of the greatest advantages of …

Stmixer: A one-stage sparse action detector

T Wu, M Cao, Z Gao, G Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Traditional video action detectors typically adopt the two-stage pipeline, where a person
detector is first employed to yield actor boxes and then 3D RoIAlign is used to extract actor …