Activity, plan, and goal recognition: A review

FA Van-Horenbeke, A Peer - Frontiers in Robotics and AI, 2021 - frontiersin.org
Recognizing the actions, plans, and goals of a person in an unconstrained environment is a
key feature that future robotic systems will need in order to achieve a natural human …

A comprehensive study of deep video action recognition

Y Zhu, X Li, C Liu, M Zolfaghari, Y **ong, C Wu… - arxiv preprint arxiv …, 2020 - arxiv.org
Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …

Elaborative rehearsal for zero-shot action recognition

S Chen, D Huang - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
The growing number of action classes has posed a new challenge for video understanding,
making Zero-Shot Action Recognition (ZSAR) a thriving direction. The ZSAR task aims to …

Progress of human action recognition research in the last ten years: a comprehensive survey

PK Singh, S Kundu, T Adhikary, R Sarkar… - … Methods in Engineering, 2021 - Springer
Abstract Human Action Recognition (HAR) has achieved a remarkable milestone in the field
of computer vision. Apart from its varied applications in human–computer interactions …

Cross-modal representation learning for zero-shot action recognition

CC Lin, K Lin, L Wang, Z Liu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We present a cross-modal Transformer-based framework, which jointly encodes video data
and text labels for zero-shot action recognition (ZSAR). Our model employs a conceptually …

Hierarchical multimodal transformer to summarize videos

B Zhao, M Gong, X Li - Neurocomputing, 2022 - Elsevier
Although video summarization has achieved tremendous success benefiting from Recurrent
Neural Networks (RNN), RNN-based methods neglect the global dependencies and multi …

Zero-shot learning for imu-based activity recognition using video embeddings

C Tong, J Ge, ND Lane - Proceedings of the ACM on Interactive, Mobile …, 2021 - dl.acm.org
The Activity Recognition Chain generally precludes the challenging scenario of recognizing
new activities that were unseen during training, despite this scenario being a practical and …

Reformulating zero-shot action recognition for multi-label actions

A Kerrigan, K Duarte, Y Rawat… - Advances in Neural …, 2021 - proceedings.neurips.cc
The goal of zero-shot action recognition (ZSAR) is to classify action classes which were not
previously seen during training. Traditionally, this is achieved by training a network to map …

Moma-lrg: Language-refined graphs for multi-object multi-actor activity parsing

Z Luo, Z Durante, L Li, W **e, R Liu… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Video-language models (VLMs), large models pre-trained on numerous but noisy
video-text pairs from the internet, have revolutionized activity recognition through their …

Transformers in action recognition: A review on temporal modeling

E Shabaninia, H Nezamabadi-pour… - arxiv preprint arxiv …, 2022 - arxiv.org
In vision-based action recognition, spatio-temporal features from different modalities are
used for recognizing activities. Temporal modeling is a long challenge of action recognition …