Deep learning-based action detection in untrimmed videos: A survey
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …
applications, and is critical for video analysis. Despite the progress of action recognition …
RGB-D data-based action recognition: a review
Classification of human actions is an ongoing research problem in computer vision. This
review is aimed to scope current literature on data fusion and action recognition techniques …
review is aimed to scope current literature on data fusion and action recognition techniques …
Actionformer: Localizing moments of actions with transformers
Self-attention based Transformer models have demonstrated impressive results for image
classification and object detection, and more recently for video understanding. Inspired by …
classification and object detection, and more recently for video understanding. Inspired by …
End-to-end temporal action detection with transformer
Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
G-tad: Sub-graph localization for temporal action detection
Temporal action detection is a fundamental yet challenging task in video understanding.
Video context is a critical cue to effectively detect actions, but current works mainly focus on …
Video context is a critical cue to effectively detect actions, but current works mainly focus on …
Relaxed transformer decoders for direct action proposal generation
Temporal action proposal generation is an important and challenging task in video
understanding, which aims at detecting all temporal segments containing action instances of …
understanding, which aims at detecting all temporal segments containing action instances of …
Human action recognition and prediction: A survey
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …
have been moving from inferring the present state to predicting the future state. Vision-based …
TallFormer: Temporal Action Localization with a Long-Memory Transformer
Most modern approaches in temporal action localization divide this problem into two parts:(i)
short-term feature extraction and (ii) long-range temporal boundary localization. Due to the …
short-term feature extraction and (ii) long-range temporal boundary localization. Due to the …
Boundary content graph neural network for temporal action proposal generation
Y Bai, Y Wang, Y Tong, Y Yang, Q Liu, J Liu - Computer Vision–ECCV …, 2020 - Springer
Temporal action proposal generation plays an important role in video action understanding,
which requires localizing high-quality action content precisely. However, generating …
which requires localizing high-quality action content precisely. However, generating …
Enriching local and global contexts for temporal action localization
Effectively tackling the problem of temporal action localization (TAL) necessitates a visual
representation that jointly pursues two confounding goals, ie, fine-grained discrimination for …
representation that jointly pursues two confounding goals, ie, fine-grained discrimination for …