Deep learning-based action detection in untrimmed videos: A survey
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …
applications, and is critical for video analysis. Despite the progress of action recognition …
Asm-loc: Action-aware segment modeling for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to recognize and localize action
segments in untrimmed videos given only video-level action labels for training. Without the …
segments in untrimmed videos given only video-level action labels for training. Without the …
Fine-grained temporal contrastive learning for weakly-supervised temporal action localization
We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …
level action labels are available during model training. Despite the recent progress, existing …
Dual-evidential learning for weakly-supervised temporal action localization
Weakly-supervised temporal action localization (WS-TAL) aims to localize the action
instances and recognize their categories with only video-level labels. Despite great …
instances and recognize their categories with only video-level labels. Despite great …
Cola: Weakly-supervised temporal action localization with snippet contrastive learning
Weakly-supervised temporal action localization (WS-TAL) aims to localize actions in
untrimmed videos with only video-level labels. Most existing models follow the" localization …
untrimmed videos with only video-level labels. Most existing models follow the" localization …
Overview of temporal action detection based on deep learning
K Hu, C Shen, T Wang, K Xu, Q **a, M **a… - Artificial Intelligence …, 2024 - Springer
Abstract Temporal Action Detection (TAD) aims to accurately capture each action interval in
an untrimmed video and to understand human actions. This paper comprehensively surveys …
an untrimmed video and to understand human actions. This paper comprehensively surveys …
Two-stream consensus network for weakly-supervised temporal action localization
Abstract Weakly-supervised Temporal Action Localization (W-TAL) aims to classify and
localize all action instances in an untrimmed video under only video-level supervision …
localize all action instances in an untrimmed video under only video-level supervision …
Weakly supervised temporal action localization via representative snippet knowledge propagation
Weakly supervised temporal action localization targets at localizing temporal boundaries of
actions and simultaneously identify their categories with only video-level category labels …
actions and simultaneously identify their categories with only video-level category labels …
Tsp: Temporally-sensitive pretraining of video encoders for localization tasks
H Alwassel, S Giancola… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Due to the large memory footprint of untrimmed videos, current state-of-the-art video
localization methods operate atop precomputed video clip features. These features are …
localization methods operate atop precomputed video clip features. These features are …
A hybrid attention mechanism for weakly-supervised temporal action localization
Weakly supervised temporal action localization is a challenging vision task due to the
absence of ground-truth temporal locations of actions in the training videos. With only video …
absence of ground-truth temporal locations of actions in the training videos. With only video …