Closed-loop matters: Dual regression networks for single image super-resolution

Y Guo, J Chen, J Wang, Q Chen… - Proceedings of the …, 2020 - openaccess.thecvf.com
Deep neural networks have exhibited promising performance in image super-resolution
(SR) by learning a nonlinear map** function from low-resolution (LR) images to high …

Relation-aware global attention for person re-identification

Z Zhang, C Lan, W Zeng, X **… - Proceedings of the ieee …, 2020 - openaccess.thecvf.com
For person re-identification (re-id), attention mechanisms have become attractive as they
aim at strengthening discriminative features and suppressing irrelevant ones, which …

Dense regression network for video grounding

R Zeng, H Xu, W Huang, P Chen… - Proceedings of the …, 2020 - openaccess.thecvf.com
We address the problem of video grounding from natural language queries. The key
challenge in this task is that one training video might only contain a few annotated …

A survey on video action recognition in sports: Datasets, methods and applications

F Wu, Q Wang, J Bian, N Ding, F Lu… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
To understand human behaviors, action recognition based on videos is a common
approach. Compared with image-based action recognition, videos provide much more …

Cat: Localization and identification cascade detection transformer for open-world object detection

S Ma, Y Wang, Y Wei, J Fan, TH Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Open-world object detection (OWOD), as a more general and challenging goal, requires the
model trained from data on known objects to detect both known and unknown objects and …

Temporal action localization in the deep learning era: A survey

B Wang, Y Zhao, L Yang, T Long… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The temporal action localization research aims to discover action instances from untrimmed
videos, representing a fundamental step in the field of intelligent video understanding. With …

Rspnet: Relative speed perception for unsupervised video representation learning

P Chen, D Huang, D He, X Long, R Zeng… - Proceedings of the …, 2021 - ojs.aaai.org
We study unsupervised video representation learning that seeks to learn both motion and
appearance features from unlabeled video only, which can be reused for downstream tasks …

Colar: Effective and efficient online action detection by consulting exemplars

L Yang, J Han, D Zhang - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Online action detection has attracted increasing research interests in recent years. Current
works model historical dependencies and anticipate the future to perceive the action …

Class semantics-based attention for action detection

D Sridhar, N Quader, S Muralidharan… - Proceedings of the …, 2021 - openaccess.thecvf.com
Action localization networks are often structured as a feature encoder sub-network and a
localization sub-network, where the feature encoder learns to transform an input video to …

Masked motion encoding for self-supervised video representation learning

X Sun, P Chen, L Chen, C Li, TH Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
How to learn discriminative video representation from unlabeled videos is challenging but
crucial for video analysis. The latest attempts seek to learn a representation model by …