- Academic Search

L Jiao, R Zhang, F Liu, S Yang, B Hou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Video object detection, a basic task in the computer vision field, is rapidly evolving and
widely used. In recent years, deep learning methods have rapidly become widespread in the …

保存引用被引用次数：227 相关文章所有 3 个版本

[Free GPT-4]

[PDF] mdpi.com

A review of video object detection: Datasets, metrics and methods

H Zhu, H Wei, B Li, X Yuan, N Kehtarnavaz - Applied Sciences, 2020 - mdpi.com

Although there are well established object detection methods based on static images, their
application to video data on a frame by frame basis faces two shortcomings:(i) lack of …

保存引用被引用次数：145 相关文章所有 10 个版本网页快照

[Free GPT-4]

[PDF] arxiv.org

Bevdet4d: Exploit temporal cues in multi-camera 3d object detection

J Huang, G Huang - arxiv preprint arxiv:2203.17054, 2022 - arxiv.org

Single frame data contains finite information which limits the performance of the existing
vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the …

保存引用被引用次数：350 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] thecvf.com

Transflow: Transformer as flow learner

Y Lu, Q Wang, S Ma, T Geng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …

保存引用被引用次数：88 相关文章所有 6 个版本 HTML 版

[Free GPT-4]

[PDF] thecvf.com

Tf-blender: Temporal feature blender for video object detection

Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …

保存引用被引用次数：193 相关文章所有 6 个版本 HTML 版

[Free GPT-4]

[PDF] thecvf.com

Ts-cam: Token semantic coupled attention map for weakly supervised object localization

W Gao, F Wan, X Pan, Z Peng, Q Tian… - Proceedings of the …, 2021 - openaccess.thecvf.com

Weakly supervised object localization (WSOL) is a challenging problem when given image
category labels but requires to learn object localization models. Optimizing a convolutional …

保存引用被引用次数：237 相关文章所有 10 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Disentangled non-local neural networks

M Yin, Z Yao, Y Cao, X Li, Z Zhang, S Lin… - Computer Vision–ECCV …, 2020 - Springer

The non-local block is a popular module for strengthening the context modeling ability of a
regular convolutional neural network. This paper first studies the non-local block in depth …

保存引用被引用次数：390 相关文章所有 7 个版本

[Free GPT-4]

[PDF] thecvf.com

Memory enhanced global-local aggregation for video object detection

Y Chen, Y Cao, H Hu, L Wang - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

How do humans recognize an object in a piece of video? Due to the deteriorated quality of
single frame, it may be hard for people to identify an occluded object in this frame by just …

保存引用被引用次数：366 相关文章所有 8 个版本 HTML 版

[Free GPT-4]

[PDF] thecvf.com

Tube-Link: A flexible cross tube framework for universal video segmentation

X Li, H Yuan, W Zhang, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video segmentation aims to segment and track every pixel in diverse scenarios accurately.
In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks …

保存引用被引用次数：49 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

保存引用被引用次数：149 相关文章所有 8 个版本

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Relation distillation networks for video object detection

New generation deep learning for video object detection: A survey

A review of video object detection: Datasets, metrics and methods

Bevdet4d: Exploit temporal cues in multi-camera 3d object detection

Transflow: Transformer as flow learner

Tf-blender: Temporal feature blender for video object detection

Ts-cam: Token semantic coupled attention map for weakly supervised object localization

Disentangled non-local neural networks

Memory enhanced global-local aggregation for video object detection

Tube-Link: A flexible cross tube framework for universal video segmentation

TransVOD: end-to-end video object detection with spatial-temporal transformers