Two-stream consensus network for weakly-supervised temporal action localization

Y Zhai, L Wang, W Tang, Q Zhang, J Yuan… - Computer Vision–ECCV …, 2020 - Springer
Abstract Weakly-supervised Temporal Action Localization (W-TAL) aims to classify and
localize all action instances in an untrimmed video under only video-level supervision …

Generalized weakly supervised object localization

D Zhang, G Guo, W Zeng, L Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the goal of learning to localize specific object semantics using the low-cost image-level
annotation, weakly supervised object localization (WSOL) has been receiving increasing …

Soar: Scene-debiasing open-set action recognition

Y Zhai, Z Liu, Z Wu, Y Wu, C Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Deep models have the risk of utilizing spurious clues to make predictions, eg, recognizing
actions via classifying the background scene. This problem severely degrades the open-set …

Exploring optical-flow-guided motion and detection-based appearance for temporal sentence grounding

D Liu, X Fang, W Hu, P Zhou - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Temporal sentence grounding aims to localize a target segment in an untrimmed video
semantically according to a given sentence query. Most previous works focus on learning …

Weakly-supervised temporal action localization: a survey

AR Baraka, MH Mohd Noor - Neural Computing and Applications, 2022 - Springer
Abstract Temporal Action Localization (TAL) is an important task of various computer vision
topics such as video understanding, summarization, and analysis. In the real world, the …

Action graphs: Weakly-supervised action localization with graph convolution networks

M Rashid, H Kjellstrom, YJ Lee - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
We present a method for weakly-supervised action localization based on graph
convolutions. In order to find and classify video time segments that correspond to relevant …

Temporal action localization in the deep learning era: A survey

B Wang, Y Zhao, L Yang, T Long… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The temporal action localization research aims to discover action instances from untrimmed
videos, representing a fundamental step in the field of intelligent video understanding. With …

A novel action saliency and context-aware network for weakly-supervised temporal action localization

Y Zhao, H Zhang, Z Gao, W Gao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Temporal action localization is a challenging task in computer vision, and it tries to find the
start time and the end time of the actions and predict their categories. However, compared to …

Weakly supervised audio-visual violence detection

P Wu, X Liu, J Liu - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
Violence detection in videos is very promising in practical applications due to the
emergence of massive videos in recent years. Most previous works define violence …

Few-shot temporal sentence grounding via memory-guided semantic learning

D Liu, P Zhou, Z Xu, H Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Temporal sentence grounding (TSG) is an important yet challenging task in video-based
information retrieval. Given an untrimmed video input, it requires the machine to predict the …