Deep learning-based action detection in untrimmed videos: A survey

E Vahdani, Y Tian - IEEE Transactions on Pattern Analysis and …, 2022 - ieeexplore.ieee.org
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …

Asm-loc: Action-aware segment modeling for weakly-supervised temporal action localization

B He, X Yang, L Kang, Z Cheng… - Proceedings of the …, 2022 - openaccess.thecvf.com
Weakly-supervised temporal action localization aims to recognize and localize action
segments in untrimmed videos given only video-level action labels for training. Without the …

Fine-grained temporal contrastive learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …

Dual-evidential learning for weakly-supervised temporal action localization

M Chen, J Gao, S Yang, C Xu - European conference on computer vision, 2022 - Springer
Weakly-supervised temporal action localization (WS-TAL) aims to localize the action
instances and recognize their categories with only video-level labels. Despite great …

Cola: Weakly-supervised temporal action localization with snippet contrastive learning

C Zhang, M Cao, D Yang, J Chen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WS-TAL) aims to localize actions in
untrimmed videos with only video-level labels. Most existing models follow the" localization …

Overview of temporal action detection based on deep learning

K Hu, C Shen, T Wang, K Xu, Q **a, M **a… - Artificial Intelligence …, 2024 - Springer
Abstract Temporal Action Detection (TAD) aims to accurately capture each action interval in
an untrimmed video and to understand human actions. This paper comprehensively surveys …

Two-stream consensus network for weakly-supervised temporal action localization

Y Zhai, L Wang, W Tang, Q Zhang, J Yuan… - Computer Vision–ECCV …, 2020 - Springer
Abstract Weakly-supervised Temporal Action Localization (W-TAL) aims to classify and
localize all action instances in an untrimmed video under only video-level supervision …

Weakly supervised temporal action localization via representative snippet knowledge propagation

L Huang, L Wang, H Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Weakly supervised temporal action localization targets at localizing temporal boundaries of
actions and simultaneously identify their categories with only video-level category labels …

Tsp: Temporally-sensitive pretraining of video encoders for localization tasks

H Alwassel, S Giancola… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Due to the large memory footprint of untrimmed videos, current state-of-the-art video
localization methods operate atop precomputed video clip features. These features are …

A hybrid attention mechanism for weakly-supervised temporal action localization

A Islam, C Long, R Radke - Proceedings of the AAAI conference on …, 2021 - ojs.aaai.org
Weakly supervised temporal action localization is a challenging vision task due to the
absence of ground-truth temporal locations of actions in the training videos. With only video …