New generation deep learning for video object detection: A survey

L Jiao, R Zhang, F Liu, S Yang, B Hou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Video object detection, a basic task in the computer vision field, is rapidly evolving and
widely used. In recent years, deep learning methods have rapidly become widespread in the …

A review of video object detection: Datasets, metrics and methods

H Zhu, H Wei, B Li, X Yuan, N Kehtarnavaz - Applied Sciences, 2020 - mdpi.com
Although there are well established object detection methods based on static images, their
application to video data on a frame by frame basis faces two shortcomings:(i) lack of …

Basicvsr++: Improving video super-resolution with enhanced propagation and alignment

KCK Chan, S Zhou, X Xu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
A recurrent structure is a popular framework choice for the task of video super-resolution.
The state-of-the-art method BasicVSR adopts bidirectional propagation with feature …

Basicvsr: The search for essential components in video super-resolution and beyond

KCK Chan, X Wang, K Yu, C Dong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Video super-resolution (VSR) approaches tend to have more components than the image
counterparts as they need to exploit the additional temporal dimension. Complex designs …

Memory enhanced global-local aggregation for video object detection

Y Chen, Y Cao, H Hu, L Wang - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
How do humans recognize an object in a piece of video? Due to the deteriorated quality of
single frame, it may be hard for people to identify an occluded object in this frame by just …

Tube-Link: A flexible cross tube framework for universal video segmentation

X Li, H Yuan, W Zhang, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video segmentation aims to segment and track every pixel in diverse scenarios accurately.
In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks …

TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

Sequence level semantics aggregation for video object detection

H Wu, Y Chen, N Wang… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Video objection detection (VID) has been a rising research direction in recent years. A
central issue of VID is the appearance degradation of video frames caused by fast motion …

Multi-object detection and tracking (MODT) machine learning model for real-time video surveillance systems

M Elhoseny - Circuits, Systems, and Signal Processing, 2020 - Springer
Recently, video surveillance has garnered considerable attention in various real-time
applications. Due to advances in the field of machine learning, numerous techniques have …

Transferable adversarial attacks for image and video object detection

X Wei, S Liang, N Chen, X Cao - arxiv preprint arxiv:1811.12641, 2018 - arxiv.org
Adversarial examples have been demonstrated to threaten many computer vision tasks
including object detection. However, the existing attacking methods for object detection have …