Activity, plan, and goal recognition: A review
FA Van-Horenbeke, A Peer - Frontiers in Robotics and AI, 2021 - frontiersin.org
Recognizing the actions, plans, and goals of a person in an unconstrained environment is a
key feature that future robotic systems will need in order to achieve a natural human …
key feature that future robotic systems will need in order to achieve a natural human …
A comprehensive study of deep video action recognition
Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …
last decade, we have witnessed great advancements in video action recognition thanks to …
Elaborative rehearsal for zero-shot action recognition
The growing number of action classes has posed a new challenge for video understanding,
making Zero-Shot Action Recognition (ZSAR) a thriving direction. The ZSAR task aims to …
making Zero-Shot Action Recognition (ZSAR) a thriving direction. The ZSAR task aims to …
Progress of human action recognition research in the last ten years: a comprehensive survey
Abstract Human Action Recognition (HAR) has achieved a remarkable milestone in the field
of computer vision. Apart from its varied applications in human–computer interactions …
of computer vision. Apart from its varied applications in human–computer interactions …
Cross-modal representation learning for zero-shot action recognition
We present a cross-modal Transformer-based framework, which jointly encodes video data
and text labels for zero-shot action recognition (ZSAR). Our model employs a conceptually …
and text labels for zero-shot action recognition (ZSAR). Our model employs a conceptually …
Hierarchical multimodal transformer to summarize videos
Although video summarization has achieved tremendous success benefiting from Recurrent
Neural Networks (RNN), RNN-based methods neglect the global dependencies and multi …
Neural Networks (RNN), RNN-based methods neglect the global dependencies and multi …
Zero-shot learning for imu-based activity recognition using video embeddings
The Activity Recognition Chain generally precludes the challenging scenario of recognizing
new activities that were unseen during training, despite this scenario being a practical and …
new activities that were unseen during training, despite this scenario being a practical and …
Reformulating zero-shot action recognition for multi-label actions
The goal of zero-shot action recognition (ZSAR) is to classify action classes which were not
previously seen during training. Traditionally, this is achieved by training a network to map …
previously seen during training. Traditionally, this is achieved by training a network to map …
Moma-lrg: Language-refined graphs for multi-object multi-actor activity parsing
Abstract Video-language models (VLMs), large models pre-trained on numerous but noisy
video-text pairs from the internet, have revolutionized activity recognition through their …
video-text pairs from the internet, have revolutionized activity recognition through their …
Transformers in action recognition: A review on temporal modeling
E Shabaninia, H Nezamabadi-pour… - arxiv preprint arxiv …, 2022 - arxiv.org
In vision-based action recognition, spatio-temporal features from different modalities are
used for recognizing activities. Temporal modeling is a long challenge of action recognition …
used for recognizing activities. Temporal modeling is a long challenge of action recognition …