Ddg-net: Discriminability-driven graph network for weakly-supervised temporal action localization

X Tang, J Fan, C Luo, Z Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task.
Due to large-scale datasets, most existing methods use a network pretrained in other …

Weakly-supervised temporal action localization with multi-modal plateau Transformers

X Hu, K Li, D Patel, E Kruus… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Weakly Supervised Temporal Action Localization (WSTAL) aims to jointly localize
and classify action segments in untrimmed videos with only video level annotations. To …

Learning proposal-aware re-ranking for weakly-supervised temporal action localization

Y Hu, J Fu, M Chen, J Gao, J Dong… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Weakly-supervised temporal action localization (WTAL) aims to localize and classify action
instances in untrimmed videos with only video-level labels available. Despite the …

Hr-pro: Point-supervised temporal action localization via hierarchical reliability propagation

H Zhang, X Wang, X Xu, Z Qing, C Gao… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Point-supervised Temporal Action Localization (PSTAL) is an emerging research direction
for label-efficient learning. However, current methods mainly focus on optimizing the network …

Uncertainty-aware dual-evidential learning for weakly-supervised temporal action localization

M Chen, J Gao, C Xu - IEEE transactions on pattern analysis …, 2023 - ieeexplore.ieee.org
Weakly-supervised temporal action localization (WTAL) aims to localize the action instances
and recognize their categories with only video-level labels. Despite great progress, existing …

Boosting positive segments for weakly-supervised audio-visual video parsing

KK Rachavarapu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In this paper, we address the problem of weakly supervised Audio-Visual Video Parsing
(AVVP), where the goal is to temporally localize events that are audible or visible and …

A snippets relation and hard-snippets mask network for weakly-supervised temporal action localization

Y Zhao, H Zhang, Z Gao, W Guan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Weakly-supervised temporal action localization (WTAL) is a problem learning an action
localization model with only video-level labels available. In recent years, many WTAL …

Revisiting Unsupervised Temporal Action Localization: The Primacy of High-Quality Actionness and Pseudolabels

H Jiang, H Tang, M Yan, J Zhang, M Xu, Y Hu… - Proceedings of the …, 2024 - dl.acm.org
Recently, temporal action localization (TAL) methods, especially the weakly-supervised and
unsupervised ones, have become a hot research topic. Existing unsupervised methods …

CGCN: context graph convolutional network for few-shot temporal action localization

S Zhang, H Wang, L Wang, X Han, Q Tian - Information Processing & …, 2025 - Elsevier
Localizing human actions in videos has attracted extensive attention from industry and
academia. Few-Shot Temporal Action Localization (FS-TAL) aims to detect human actions in …

Weakly supervised temporal action localization with actionness-guided false positive suppression

Z Li, Z Wang, Q Liu - Neural Networks, 2024 - Elsevier
Weakly supervised temporal action localization aims to locate the temporal boundaries of
action instances in untrimmed videos using video-level labels and assign them the …