Collecting cross-modal presence-absence evidence for weakly-supervised audio-visual event perception

J Gao, M Chen, C Xu - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
With only video-level event labels, this paper targets at the task of weakly-supervised audio-
visual event perception (WS-AVEP), which aims to temporally localize and categorize events …

Pivotal: Prior-driven supervision for weakly-supervised temporal action localization

MN Rizve, G Mittal, Y Yu, M Hall… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Weakly-supervised Temporal Action Localization (WTAL) attempts to localize the
actions in untrimmed videos using only video-level supervision. Most recent works approach …

Vectorized evidential learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - IEEE transactions on pattern analysis …, 2023 - ieeexplore.ieee.org
With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …

Weakly supervised temporal sentence grounding with uncertainty-guided self-training

Y Huang, L Yang, Y Sato - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
The task of weakly supervised temporal sentence grounding aims at finding the
corresponding temporal moments of a language description in the video, given video …

Vqacl: A novel visual question answering continual learning setting

X Zhang, F Zhang, C Xu - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Research on continual learning has recently led to a variety of work in unimodal community,
however little attention has been paid to multimodal tasks like visual question answering …

A Comprehensive Survey on Evidential Deep Learning and Its Applications

J Gao, M Chen, L **ang, C Xu - arxiv preprint arxiv:2409.04720, 2024 - arxiv.org
Reliable uncertainty estimation has become a crucial requirement for the industrial
deployment of deep learning algorithms, particularly in high-risk applications such as …

Uncertainty-aware Action Decoupling Transformer for Action Anticipation

H Guo, N Agarwal, SY Lo, K Lee… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human action anticipation aims at predicting what people will do in the future based on past
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …

Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization

C Ju, K Zheng, J Liu, P Zhao, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WTAL) learns to detect and classify action
instances with only category labels. Most methods widely adopt the off-the-shelf …

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

X Tang, J Fan, C Luo, Z Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task.
Due to large-scale datasets, most existing methods use a network pretrained in other …

Improving weakly supervised temporal action localization by bridging train-test gap in pseudo labels

J Zhou, L Huang, L Wang, S Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The task of weakly supervised temporal action localization targets at generating temporal
boundaries for actions of interest, meanwhile the action category should also be classified …