Collecting cross-modal presence-absence evidence for weakly-supervised audio-visual event perception
With only video-level event labels, this paper targets at the task of weakly-supervised audio-
visual event perception (WS-AVEP), which aims to temporally localize and categorize events …
visual event perception (WS-AVEP), which aims to temporally localize and categorize events …
Pivotal: Prior-driven supervision for weakly-supervised temporal action localization
Abstract Weakly-supervised Temporal Action Localization (WTAL) attempts to localize the
actions in untrimmed videos using only video-level supervision. Most recent works approach …
actions in untrimmed videos using only video-level supervision. Most recent works approach …
Vectorized evidential learning for weakly-supervised temporal action localization
With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …
TAL) task has become a promising research direction in pattern analysis and machine …
Weakly supervised temporal sentence grounding with uncertainty-guided self-training
The task of weakly supervised temporal sentence grounding aims at finding the
corresponding temporal moments of a language description in the video, given video …
corresponding temporal moments of a language description in the video, given video …
Vqacl: A novel visual question answering continual learning setting
Research on continual learning has recently led to a variety of work in unimodal community,
however little attention has been paid to multimodal tasks like visual question answering …
however little attention has been paid to multimodal tasks like visual question answering …
A Comprehensive Survey on Evidential Deep Learning and Its Applications
Reliable uncertainty estimation has become a crucial requirement for the industrial
deployment of deep learning algorithms, particularly in high-risk applications such as …
deployment of deep learning algorithms, particularly in high-risk applications such as …
Uncertainty-aware Action Decoupling Transformer for Action Anticipation
Human action anticipation aims at predicting what people will do in the future based on past
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …
Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization
Weakly-supervised temporal action localization (WTAL) learns to detect and classify action
instances with only category labels. Most methods widely adopt the off-the-shelf …
instances with only category labels. Most methods widely adopt the off-the-shelf …
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization
Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task.
Due to large-scale datasets, most existing methods use a network pretrained in other …
Due to large-scale datasets, most existing methods use a network pretrained in other …
Improving weakly supervised temporal action localization by bridging train-test gap in pseudo labels
The task of weakly supervised temporal action localization targets at generating temporal
boundaries for actions of interest, meanwhile the action category should also be classified …
boundaries for actions of interest, meanwhile the action category should also be classified …