Space-time representation of people based on 3D skeletal data: A review
Spatiotemporal human representation based on 3D visual perception data is a rapidly
growing research area. Representations can be broadly categorized into two groups …
growing research area. Representations can be broadly categorized into two groups …
Fine-grained image analysis with deep learning: A survey
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …
vision and pattern recognition, and underpins a diverse set of real-world applications. The …
Spatial temporal graph convolutional networks for skeleton-based action recognition
Dynamics of human body skeletons convey significant information for human action
recognition. Conventional approaches for modeling skeletons usually rely on hand-crafted …
recognition. Conventional approaches for modeling skeletons usually rely on hand-crafted …
Distribution consistency based covariance metric networks for few-shot learning
Few-shot learning aims to recognize new concepts from very few examples. However, most
of the existing few-shot learning methods mainly concentrate on the first-order statistic of …
of the existing few-shot learning methods mainly concentrate on the first-order statistic of …
Few-shot action recognition with permutation-invariant attention
Many few-shot learning models focus on recognising images. In contrast, we tackle a
challenging task of few-shot action recognition from videos. We build on a C3D encoder for …
challenging task of few-shot action recognition from videos. We build on a C3D encoder for …
Learning rich part hierarchies with progressive attention networks for fine-grained image recognition
We investigate the localization of subtle yet discriminative parts for fine-grained image
recognition. Based on the observation that such parts typically exist within a hierarchical …
recognition. Based on the observation that such parts typically exist within a hierarchical …
Tensor representations for action recognition
Human actions in video sequences are characterized by the complex interplay between
spatial features and their temporal dynamics. In this paper, we propose novel tensor …
spatial features and their temporal dynamics. In this paper, we propose novel tensor …
SPARE: Self-supervised part erasing for ultra-fine-grained visual categorization
This paper presents SPARE, a self-supervised part erasing framework for ultra-fine-grained
visual categorization. The key insight of our model is to learn discriminative representations …
visual categorization. The key insight of our model is to learn discriminative representations …
Discriminative multi-instance multitask learning for 3D action recognition
As the prosperity of low-cost and easy-operating depth cameras, skeleton-based human
action recognition has been extensively studied recently. However, most of the existing …
action recognition has been extensively studied recently. However, most of the existing …
Benchmark platform for ultra-fine-grained visual categorization beyond human performance
Deep learning methods have achieved remarkable success in fine-grained visual
categorization. Such successful categorization at sub-ordinate level, eg, different animal or …
categorization. Such successful categorization at sub-ordinate level, eg, different animal or …