Space-time representation of people based on 3D skeletal data: A review

F Han, B Reily, W Hoff, H Zhang - Computer Vision and Image …, 2017 - Elsevier
Spatiotemporal human representation based on 3D visual perception data is a rapidly
growing research area. Representations can be broadly categorized into two groups …

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Spatial temporal graph convolutional networks for skeleton-based action recognition

S Yan, Y **ong, D Lin - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org
Dynamics of human body skeletons convey significant information for human action
recognition. Conventional approaches for modeling skeletons usually rely on hand-crafted …

Distribution consistency based covariance metric networks for few-shot learning

W Li, J Xu, J Huo, L Wang, Y Gao, J Luo - Proceedings of the AAAI …, 2019 - aaai.org
Few-shot learning aims to recognize new concepts from very few examples. However, most
of the existing few-shot learning methods mainly concentrate on the first-order statistic of …

Few-shot action recognition with permutation-invariant attention

H Zhang, L Zhang, X Qi, H Li, PHS Torr… - Computer Vision–ECCV …, 2020 - Springer
Many few-shot learning models focus on recognising images. In contrast, we tackle a
challenging task of few-shot action recognition from videos. We build on a C3D encoder for …

Learning rich part hierarchies with progressive attention networks for fine-grained image recognition

H Zheng, J Fu, ZJ Zha, J Luo… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
We investigate the localization of subtle yet discriminative parts for fine-grained image
recognition. Based on the observation that such parts typically exist within a hierarchical …

Tensor representations for action recognition

P Koniusz, L Wang, A Cherian - IEEE Transactions on Pattern …, 2021 - ieeexplore.ieee.org
Human actions in video sequences are characterized by the complex interplay between
spatial features and their temporal dynamics. In this paper, we propose novel tensor …

SPARE: Self-supervised part erasing for ultra-fine-grained visual categorization

X Yu, Y Zhao, Y Gao - Pattern Recognition, 2022 - Elsevier
This paper presents SPARE, a self-supervised part erasing framework for ultra-fine-grained
visual categorization. The key insight of our model is to learn discriminative representations …

Discriminative multi-instance multitask learning for 3D action recognition

Y Yang, C Deng, S Gao, W Liu, D Tao… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
As the prosperity of low-cost and easy-operating depth cameras, skeleton-based human
action recognition has been extensively studied recently. However, most of the existing …

Benchmark platform for ultra-fine-grained visual categorization beyond human performance

X Yu, Y Zhao, Y Gao, X Yuan… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Deep learning methods have achieved remarkable success in fine-grained visual
categorization. Such successful categorization at sub-ordinate level, eg, different animal or …