Pivotal: Prior-driven supervision for weakly-supervised temporal action localization

MN Rizve, G Mittal, Y Yu, M Hall… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Weakly-supervised Temporal Action Localization (WTAL) attempts to localize the
actions in untrimmed videos using only video-level supervision. Most recent works approach …

Spact: Self-supervised privacy preservation for action recognition

IR Dave, C Chen, M Shah - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Visual private information leakage is an emerging key issue for the fast growing applications
of video understanding like activity recognition. Existing approaches for mitigating privacy …

Online Action Detection in Surveillance Scenarios: A Comprehensive Review and Comparative Study of State-of-the-Art Multi-Object Tracking Methods

J Alikhanov, H Kim - IEEE Access, 2023 - ieeexplore.ieee.org
Online action detection in surveillance scenarios presents considerable challenges,
particularly due to the dynamically changing environments and real-time processing …

A survey on deep learning-based spatio-temporal action detection

P Wang, F Zeng, Y Qian - arxiv preprint arxiv:2308.01618, 2023 - arxiv.org
Spatio-temporal action detection (STAD) aims to classify the actions present in a video and
localize them in space and time. It has become a particularly active area of research in …

Timebalance: Temporally-invariant and temporally-distinctive video representations for semi-supervised action recognition

IR Dave, MN Rizve, C Chen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Semi-Supervised Learning can be more beneficial for the video domain compared
to images because of its higher annotation cost and dimensionality. Besides, any video …

On occlusions in video action detection: benchmark datasets and training recipes

R Modi, V Vineet, Y Rawat - Advances in Neural …, 2024 - proceedings.neurips.cc
This paper explores the impact of occlusions in video action detection. We facilitatethis study
by introducing five new benchmark datasets namely O-UCF and O-JHMDB consisting of …

Audio-visual glance network for efficient video recognition

MA Nugroho, S Woo, S Lee… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Deep learning has made significant strides in video understanding tasks, but the
computation required to classify lengthy and massive videos using clip-level video …

Transvisdrone: Spatio-temporal transformer for vision-based drone-to-drone detection in aerial videos

T Sangam, IR Dave, W Sultani… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Drone-to-drone detection using visual feed has crucial applications, such as detecting drone
collisions, detecting drone attacks, or coordinating flight with other drones. However, existing …

Spatio-temporal action detection under large motion

G Singh, V Choutas, S Saha, F Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current methods for spatiotemporal action tube detection often extend a bounding box
proposal at a given key-frame into a 3D temporal cuboid and pool features from nearby …

Sync from the sea: retrieving alignable videos from large-scale datasets

IR Dave, FC Heilbron, M Shah, S Jenni - European Conference on …, 2024 - Springer
Temporal video alignment aims to synchronize the key events like object interactions or
action phase transitions in two videos. Such methods could benefit various video editing …