Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

Transformer for skeleton-based action recognition: A review of recent advances

W **n, R Liu, Y Liu, Y Chen, W Yu, Q Miao - Neurocomputing, 2023 - Elsevier
Skeleton-based action recognition has rapidly become one of the most popular and
essential research topics in computer vision. The task is to analyze the characteristics of …

Masked motion predictors are strong 3d action representation learners

Y Mao, J Deng, W Zhou, Y Fang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In 3D human action recognition, limited supervised data makes it challenging to fully tap into
the modeling potential of powerful networks such as transformers. As a result, researchers …

Audio-visual class-incremental learning

W Pian, S Mo, Y Guo, Y Tian - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In this paper, we introduce audio-visual class-incremental learning, a class-incremental
learning scenario for audio-visual video recognition. We demonstrate that joint audio-visual …

Visual tuning

BXB Yu, J Chang, H Wang, L Liu, S Wang… - ACM Computing …, 2024 - dl.acm.org
Fine-tuning visual models has been widely shown promising performance on many
downstream visual tasks. With the surprising development of pre-trained visual foundation …

Skeleton cloud colorization for unsupervised 3d action representation learning

S Yang, J Liu, S Lu, MH Er… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Skeleton-based human action recognition has attracted increasing attention in recent years.
However, most of the existing works focus on supervised learning which requiring a large …

Self-regularized prototypical network for few-shot semantic segmentation

H Ding, H Zhang, X Jiang - Pattern Recognition, 2023 - Elsevier
The deep CNNs in image semantic segmentation typically require a large number of
densely-annotated images for training and have difficulties in generalizing to unseen object …

Direcformer: A directed attention in transformer approach to robust action recognition

TD Truong, QH Bui, CN Duong… - Proceedings of the …, 2022 - openaccess.thecvf.com
Human action recognition has recently become one ofthe popular research topics in the
computer vision community. Various 3D-CNN based methods have been presented to tackle …

Recent advances of continual learning in computer vision: An overview

H Qu, H Rahmani, L Xu, B Williams, J Liu - arxiv preprint arxiv …, 2021 - arxiv.org
In contrast to batch learning where all training data is available at once, continual learning
represents a family of methods that accumulate knowledge and learn continuously with data …

Cmd: Self-supervised 3d action representation learning with cross-modal mutual distillation

Y Mao, W Zhou, Z Lu, J Deng, H Li - European Conference on Computer …, 2022 - Springer
In 3D action recognition, there exists rich complementary information between skeleton
modalities. Nevertheless, how to model and utilize this information remains a challenging …