Modeling the relative visual tempo for self-supervised skeleton-based action recognition

Y Zhu, H Han, Z Yu, G Liu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Visual tempo characterizes the dynamics and the temporal evolution, which helps describe
actions. Recent approaches directly perform visual tempo prediction on skeleton sequences …

SelfGCN: Graph convolution network with self-attention for skeleton-based action recognition

Z Wu, P Sun, X Chen, K Tang, T Xu… - … on Image Processing, 2024 - ieeexplore.ieee.org
Graph Convolutional Networks (GCNs) are widely used for skeleton-based action
recognition and achieved remarkable performance. Due to the locality of graph convolution …

Focusing fine-grained action by self-attention-enhanced graph neural networks with contrastive learning

P Geng, X Lu, C Hu, H Liu, L Lyu - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the aid of graph convolution neural network and transformer model, human action
recognition has achieved significant performance based on skeleton data. However, the …

Feature reconstruction graph convolutional network for skeleton-based action recognition

J Huang, Z Wang, J Peng, F Huang - Engineering Applications of Artificial …, 2023 - Elsevier
Skeleton-based action recognition is an important task in computer vision. Recently, graph
convolutional networks (GCNs) have been successfully applied to this task and achieved …

Exploring incomplete decoupling modeling with window and cross-window mechanism for skeleton-based action recognition

S Li, X **ang, J Fang, J Zhang, S Cheng… - Knowledge-Based …, 2023 - Elsevier
Skeleton-based action recognition relies on capturing connections between joints to extract
action-specific features. Current approaches utilizing temporal convolution for inter-frame …

Top-heavy CapsNets based on spatiotemporal non-local for action recognition

MH Ha - Journal of Computing Theories and Applications, 2024 - dl.futuretechsci.org
To effectively comprehend human actions, we have developed a Deep Neural Network
(DNN) that utilizes inner spatiotemporal non-locality to capture meaningful semantic context …

Human pose-based estimation, tracking and action recognition with deep learning: A survey

L Zhou, X Meng, Z Liu, M Wu, Z Gao… - arxiv preprint arxiv …, 2023 - arxiv.org
Human pose analysis has garnered significant attention within both the research community
and practical applications, owing to its expanding array of uses, including gaming, video …

Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy

X Yun, C Xu, K Riou, K Dong, Y Sun, S Li… - Proceedings of the …, 2024 - ojs.aaai.org
The deployment of multi-stream fusion strategy on behavioral recognition from skeletal data
can extract complementary features from different information streams and improve the …

Modeling the skeleton-language uncertainty for 3D action recognition

M Wang, X Zhang, S Chen, X Li, Y Zhang - Neurocomputing, 2024 - Elsevier
Human 3D skeleton-based action recognition has received increasing interest in recent
years. Inspired by the excellent ability of the multi-modal model, some pioneer attempts to …

A direction-decoupled non-local attention network for single image super-resolution

Z Song, B Zhong, J Ji, KK Ma - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org
The non-local attention mechanism has often been exploited in deep learning to capture
long-range dependencies (LRDs) from the same image for enhancing the performance of …