A comprehensive survey of vision-based human action recognition methods

HB Zhang, YX Zhang, B Zhong, Q Lei, L Yang, JX Du… - Sensors, 2019 - mdpi.com
Although widely used in many applications, accurate and efficient human action recognition
remains a challenging area of research in the field of computer vision. Most recent surveys …

Human pose estimation and its application to action recognition: A survey

L Song, G Yu, J Yuan, Z Liu - Journal of Visual Communication and Image …, 2021 - Elsevier
Human pose estimation aims at predicting the poses of human body parts in images or
videos. Since pose motions are often driven by some specific human actions, knowing the …

Actionlet-dependent contrastive learning for unsupervised skeleton-based action recognition

L Lin, J Zhang, J Liu - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
The self-supervised pretraining paradigm has achieved great success in skeleton-based
action recognition. However, these methods treat the motion and static parts equally, and …

Skeletonmae: graph-based masked autoencoder for skeleton sequence pre-training

H Yan, Y Liu, Y Wei, Z Li, G Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Skeleton sequence representation learning has shown great advantages for action
recognition due to its promising ability to model human joints and topology. However, the …

Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer
Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

Temporal decoupling graph convolutional network for skeleton-based gesture recognition

J Liu, X Wang, C Wang, Y Gao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Skeleton-based gesture recognition methods have achieved high success using Graph
Convolutional Network (GCN), which commonly uses an adjacency matrix to model the …

3d human action representation learning via cross-view consistency pursuit

L Li, M Wang, B Ni, H Wang, J Yang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this work, we propose a Cross-view Contrastive Learning framework for unsupervised 3D
skeleton-based action representation (CrosSCLR), by leveraging multi-view complementary …

Contrastive learning from extremely augmented skeleton sequences for self-supervised action recognition

T Guo, H Liu, Z Chen, M Liu, T Wang… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
In recent years, self-supervised representation learning for skeleton-based action
recognition has been developed with the advance of contrastive learning methods. The …

Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

Mmnet: A model-based multimodal network for human action recognition in rgb-d videos

XB Bruce, Y Liu, X Zhang, S Zhong… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Human action recognition (HAR) in RGB-D videos has been widely investigated since the
release of affordable depth sensors. Currently, unimodal approaches (eg, skeleton-based …