Spatio-temporal attention-based LSTM networks for 3D action recognition and detection

S Song, C Lan, J **ng, W Zeng… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Human action analytics has attracted a lot of attention for decades in computer vision. It is
important to extract discriminative spatio-temporal features to model the spatial and temporal …

A benchmark dataset and comparison study for multi-modal human action analytics

J Liu, S Song, C Liu, Y Li, Y Hu - ACM Transactions on Multimedia …, 2020 - dl.acm.org
Large-scale benchmarks provide a solid foundation for the development of action analytics.
Most of the previous activity benchmarks focus on analyzing actions in RGB videos. There is …

Unsupervised feature learning of human actions as trajectories in pose embedding manifold

JN Kundu, M Gor, PK Uppala… - 2019 IEEE winter …, 2019 - ieeexplore.ieee.org
An unsupervised human action modeling framework can provide useful pose-sequence
representation, which can be utilized in a variety of pose analysis applications. In this work …

Graph-aware transformer for skeleton-based action recognition

J Zhang, W **e, C Wang, R Tu, Z Tu - The Visual Computer, 2023 - Springer
Recently, graph convolutional networks (GCNs) play a critical role in skeleton-based human
action recognition. However, most GCN-based methods still have two main limitations:(1) …

Skeleton-indexed deep multi-modal feature learning for high performance human action recognition

S Song, C Lan, J **ng, W Zeng… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
This paper presents a new framework for action recognition with multi-modal data. A
skeleton-indexed feature learning procedure is developed to further exploit the detailed …

Nuta: Non-uniform temporal aggregation for action recognition

X Li, C Liu, B Shuai, Y Zhu, H Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
In the world of action recognition research, one primary focus has been on how to construct
and train networks to model the spatial-temporal volume of an input video. These methods …

Learning to recognize human actions from noisy skeleton data via noise adaptation

S Song, J Liu, L Lin, Z Guo - IEEE Transactions on Multimedia, 2021 - ieeexplore.ieee.org
Recent studies have made great progress on skeleton-based action recognition. However,
most of them are developed with relatively clean skeletons without the presence of intensive …

Selective feature compression for efficient activity recognition inference

C Liu, X Li, H Chen, D Modolo… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Most action recognition solutions rely on dense sampling to precisely cover the informative
temporal clip. Extensively searching temporal region is expensive for a real-world …

A fine-to-coarse convolutional neural network for 3D human action recognition

TM Le, N Inoue, K Shinoda - arxiv preprint arxiv:1805.11790, 2018 - arxiv.org
This paper presents a new framework for human action recognition from a 3D skeleton
sequence. Previous studies do not fully utilize the temporal relationships between video …

[PDF][PDF] Action Recognition with the Augmented MoCap Data using Neural Data Translation.

SY Lin, YY Lin - BMVC, 2018 - bmvc2018.org
This study aims at generating reliable augmented training data to learn a robust deep model
for action recognition. The prior knowledge inferred from few training data is not sufficient to …