Weakly supervised object localization and detection: A survey
As an emerging and challenging problem in the computer vision community, weakly
supervised object localization and detection plays an important role for develo** new …
supervised object localization and detection plays an important role for develo** new …
Deep learning-based action detection in untrimmed videos: A survey
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …
applications, and is critical for video analysis. Despite the progress of action recognition …
Temporal action detection with structured segment networks
Detecting actions in untrimmed videos is an important yet challenging task. In this paper, we
present the structured segment network (SSN), a novel framework which models the …
present the structured segment network (SSN), a novel framework which models the …
Mist: Multiple instance self-training framework for video anomaly detection
Weakly supervised video anomaly detection (WS-VAD) is to distinguish anomalies from
normal events based on discriminative representations. Most existing works are limited in …
normal events based on discriminative representations. Most existing works are limited in …
Rescaling egocentric vision: Collection, pipeline and challenges for epic-kitchens-100
This paper introduces the pipeline to extend the largest dataset in egocentric vision, EPIC-
KITCHENS. The effort culminates in EPIC-KITCHENS-100, a collection of 100 hours, 20M …
KITCHENS. The effort culminates in EPIC-KITCHENS-100, a collection of 100 hours, 20M …
TN-ZSTAD: Transferable network for zero-shot temporal activity detection
An integral part of video analysis and surveillance is temporal activity detection, which
means to simultaneously recognize and localize activities in long untrimmed videos …
means to simultaneously recognize and localize activities in long untrimmed videos …
End-to-end temporal action detection with transformer
Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
A comprehensive study of deep video action recognition
Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …
last decade, we have witnessed great advancements in video action recognition thanks to …
Asm-loc: Action-aware segment modeling for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to recognize and localize action
segments in untrimmed videos given only video-level action labels for training. Without the …
segments in untrimmed videos given only video-level action labels for training. Without the …
Fine-grained temporal contrastive learning for weakly-supervised temporal action localization
We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …
level action labels are available during model training. Despite the recent progress, existing …