Two-stream consensus network for weakly-supervised temporal action localization
Abstract Weakly-supervised Temporal Action Localization (W-TAL) aims to classify and
localize all action instances in an untrimmed video under only video-level supervision …
localize all action instances in an untrimmed video under only video-level supervision …
Generalized weakly supervised object localization
D Zhang, G Guo, W Zeng, L Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the goal of learning to localize specific object semantics using the low-cost image-level
annotation, weakly supervised object localization (WSOL) has been receiving increasing …
annotation, weakly supervised object localization (WSOL) has been receiving increasing …
Soar: Scene-debiasing open-set action recognition
Deep models have the risk of utilizing spurious clues to make predictions, eg, recognizing
actions via classifying the background scene. This problem severely degrades the open-set …
actions via classifying the background scene. This problem severely degrades the open-set …
Exploring optical-flow-guided motion and detection-based appearance for temporal sentence grounding
Temporal sentence grounding aims to localize a target segment in an untrimmed video
semantically according to a given sentence query. Most previous works focus on learning …
semantically according to a given sentence query. Most previous works focus on learning …
Weakly-supervised temporal action localization: a survey
Abstract Temporal Action Localization (TAL) is an important task of various computer vision
topics such as video understanding, summarization, and analysis. In the real world, the …
topics such as video understanding, summarization, and analysis. In the real world, the …
Action graphs: Weakly-supervised action localization with graph convolution networks
We present a method for weakly-supervised action localization based on graph
convolutions. In order to find and classify video time segments that correspond to relevant …
convolutions. In order to find and classify video time segments that correspond to relevant …
Temporal action localization in the deep learning era: A survey
The temporal action localization research aims to discover action instances from untrimmed
videos, representing a fundamental step in the field of intelligent video understanding. With …
videos, representing a fundamental step in the field of intelligent video understanding. With …
A novel action saliency and context-aware network for weakly-supervised temporal action localization
Y Zhao, H Zhang, Z Gao, W Gao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Temporal action localization is a challenging task in computer vision, and it tries to find the
start time and the end time of the actions and predict their categories. However, compared to …
start time and the end time of the actions and predict their categories. However, compared to …
Weakly supervised audio-visual violence detection
Violence detection in videos is very promising in practical applications due to the
emergence of massive videos in recent years. Most previous works define violence …
emergence of massive videos in recent years. Most previous works define violence …
Few-shot temporal sentence grounding via memory-guided semantic learning
Temporal sentence grounding (TSG) is an important yet challenging task in video-based
information retrieval. Given an untrimmed video input, it requires the machine to predict the …
information retrieval. Given an untrimmed video input, it requires the machine to predict the …