Transformer for skeleton-based action recognition: A review of recent advances

W **n, R Liu, Y Liu, Y Chen, W Yu, Q Miao - Neurocomputing, 2023 - Elsevier
Skeleton-based action recognition has rapidly become one of the most popular and
essential research topics in computer vision. The task is to analyze the characteristics of …

rppg-mae: Self-supervised pretraining with masked autoencoders for remote physiological measurements

X Liu, Y Zhang, Z Yu, H Lu, H Yue… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Remote photoplethysmography (rPPG) is an important technique for detecting human vital
signs and has received extensive attention. For a long time, researchers have focused …

Decompose more and aggregate better: Two closer looks at frequency representation learning for human motion prediction

X Gao, S Du, Y Wu, Y Yang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Encouraged by the effectiveness of encoding temporal dynamics within the frequency
domain, recent human motion prediction systems prefer to first convert the motion …

Self-supervised 3D action representation learning with skeleton cloud colorization

S Yang, J Liu, S Lu, EM Hwa, Y Hu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
3D Skeleton-based human action recognition has attracted increasing attention in recent
years. Most of the existing work focuses on supervised learning which requires a large …

Motion guided attention learning for self-supervised 3D human action recognition

Y Yang, G Liu, X Gao - … on Circuits and Systems for Video …, 2022 - ieeexplore.ieee.org
3D human action recognition has received increasing attention due to its potential
application in video surveillance equipment. To guarantee satisfactory performance …

Deep neural networks in video human action recognition: A review

Z Wang, Y Yang, Z Liu, Y Zheng - arxiv preprint arxiv:2305.15692, 2023 - arxiv.org
Currently, video behavior recognition is one of the most foundational tasks of computer
vision. The 2D neural networks of deep learning are built for recognizing pixel-level …

Glimpse and focus: Global and local-scale graph convolution network for skeleton-based action recognition

X Gao, S Du, Y Yang - Neural Networks, 2023 - Elsevier
In the 3D skeleton-based action recognition task, learning rich spatial and temporal motion
patterns from body joints are two foundational yet under-explored problems. In this paper …

Learning heterogeneous spatial–temporal context for skeleton-based action recognition

X Gao, Y Yang, Y Wu, S Du - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org
Graph convolution networks (GCNs) have been widely used and achieved fruitful progress
in the skeleton-based action recognition task. In GCNs, node interaction modeling …

Learning representations by contrastive spatio-temporal clustering for skeleton-based action recognition

M Wang, X Li, S Chen, X Zhang, L Ma… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Self-supervised representation learning has proven constructive for skeleton-based action
recognition. For better performance, existing methods mainly focus on 1) multi-modal data …

Guess: Gradually enriching synthesis for text-driven human motion generation

X Gao, Y Yang, Z **e, S Du, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this article, we propose a novel cascaded diffusion-based generative framework for text-
driven human motion synthesis, which exploits a strategy named GradUally Enriching …