Deep learning-based action detection in untrimmed videos: A survey
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …
applications, and is critical for video analysis. Despite the progress of action recognition …
Fine-grained temporal contrastive learning for weakly-supervised temporal action localization
We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …
level action labels are available during model training. Despite the recent progress, existing …
Dual-evidential learning for weakly-supervised temporal action localization
Weakly-supervised temporal action localization (WS-TAL) aims to localize the action
instances and recognize their categories with only video-level labels. Despite great …
instances and recognize their categories with only video-level labels. Despite great …
Weakly supervised temporal action localization via representative snippet knowledge propagation
Weakly supervised temporal action localization targets at localizing temporal boundaries of
actions and simultaneously identify their categories with only video-level category labels …
actions and simultaneously identify their categories with only video-level category labels …
Exploring denoised cross-video contrast for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to localize actions in untrimmed videos
with only video-level labels. Most existing methods address this problem with a" localization …
with only video-level labels. Most existing methods address this problem with a" localization …
Proposal-based multiple instance learning for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to localize and recognize actions in
untrimmed videos with only video-level category labels during training. Without instance …
untrimmed videos with only video-level category labels during training. Without instance …
Pivotal: Prior-driven supervision for weakly-supervised temporal action localization
Abstract Weakly-supervised Temporal Action Localization (WTAL) attempts to localize the
actions in untrimmed videos using only video-level supervision. Most recent works approach …
actions in untrimmed videos using only video-level supervision. Most recent works approach …
Vectorized evidential learning for weakly-supervised temporal action localization
With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …
TAL) task has become a promising research direction in pattern analysis and machine …
Cross-modal background suppression for audio-visual event localization
Audiovisual Event (AVE) localization requires the model to jointly localize an event by
observing audio and visual information. However, in unconstrained videos, both information …
observing audio and visual information. However, in unconstrained videos, both information …
Boosting weakly-supervised temporal action localization with text information
Due to the lack of temporal annotation, current Weakly-supervised Temporal Action
Localization (WTAL) methods are generally stuck into over-complete or incomplete …
Localization (WTAL) methods are generally stuck into over-complete or incomplete …