Human action recognition from various data modalities: A review
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …
each action. It has a wide range of applications, and therefore has been attracting increasing …
Accurate medium-range global weather forecasting with 3D neural networks
Weather forecasting is important for science and society. At present, the most accurate
forecast system is the numerical weather prediction (NWP) method, which represents …
forecast system is the numerical weather prediction (NWP) method, which represents …
Videomae v2: Scaling video masked autoencoders with dual masking
Scale is the primary factor for building a powerful foundation model that could well
generalize to a variety of downstream tasks. However, it is still challenging to train video …
generalize to a variety of downstream tasks. However, it is still challenging to train video …
Tdn: Temporal difference networks for efficient action recognition
Temporal modeling still remains challenging for action recognition in videos. To mitigate this
issue, this paper presents a new video architecture, termed as Temporal Difference Network …
issue, this paper presents a new video architecture, termed as Temporal Difference Network …
A comprehensive study of deep video action recognition
Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …
last decade, we have witnessed great advancements in video action recognition thanks to …
Video understanding with large language models: A survey
With the burgeoning growth of online video platforms and the escalating volume of video
content, the demand for proficient video understanding tools has intensified markedly. Given …
content, the demand for proficient video understanding tools has intensified markedly. Given …
A comprehensive survey of rgb-based and skeleton-based human action recognition
C Wang, J Yan - IEEE Access, 2023 - ieeexplore.ieee.org
With the advancement of computer vision, human action recognition (HAR) has shown its
broad research worth and application prospects in a wide range of fields such as intelligent …
broad research worth and application prospects in a wide range of fields such as intelligent …
Delving into the local: Dynamic inconsistency learning for deepfake video detection
The rapid development of facial manipulation techniques has aroused public concerns in
recent years. Existing deepfake video detection approaches attempt to capture the discrim …
recent years. Existing deepfake video detection approaches attempt to capture the discrim …
Video contrastive learning with global context
Contrastive learning has revolutionized the self-supervised image representation learning
field and recently been adapted to the video domain. One of the greatest advantages of …
field and recently been adapted to the video domain. One of the greatest advantages of …
Stmixer: A one-stage sparse action detector
Traditional video action detectors typically adopt the two-stage pipeline, where a person
detector is first employed to yield actor boxes and then 3D RoIAlign is used to extract actor …
detector is first employed to yield actor boxes and then 3D RoIAlign is used to extract actor …