Siammask: A framework for fast online object tracking and segmentation

W Hu, Q Wang, L Zhang, L Bertinetto… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In this article, we introduce SiamMask, a framework to perform both visual object tracking
and video object segmentation, in real-time, with the same simple method. We improve the …

Unsupervised learning of accurate Siamese tracking

Q Shen, L Qiao, J Guo, P Li, X Li, B Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Unsupervised learning has been popular in various computer vision tasks, including visual
object tracking. However, prior unsupervised tracking approaches rely heavily on spatial …

Learning to track objects from unlabeled videos

J Zheng, C Ma, H Peng, X Yang - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we propose to learn an Unsupervised Single Object Tracker (USOT) from
scratch. We identify that three major challenges, ie, moving object discovery, rich temporal …

Diff-tracker: text-to-image diffusion models are unsupervised trackers

Z Zhang, L Xu, D Peng, H Rahmani, J Liu - European Conference on …, 2024 - Springer
Abstract We introduce Diff-Tracker, a novel approach for the challenging unsupervised
visual tracking task leveraging the pre-trained text-to-image diffusion model. Our main idea …

Progressive unsupervised learning for visual object tracking

Q Wu, J Wan, AB Chan - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
In this paper, we propose a progressive unsupervised learning (PUL) framework, which
entirely removes the need for annotated training videos in visual tracking. Specifically, we …

Midas: Generating mmWave radar data from videos for training pervasive and privacy-preserving human sensing tasks

K Deng, D Zhao, Q Han, Z Zhang, S Wang… - Proceedings of the …, 2023 - dl.acm.org
Millimeter wave radar is a promising sensing modality for enabling pervasive and privacy-
preserving human sensing. However, the lack of large-scale radar datasets limits the …

Spatiotemporal dilated convolution with uncertain matching for video-based crowd estimation

YJ Ma, HH Shuai, WH Cheng - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this paper, we propose a novel SpatioTemporal convolutional Dense Network (STDNet) to
address the video-based crowd counting problem, which contains the decomposition of 3D …

An effective motion-centric paradigm for 3d single object tracking in point clouds

C Zheng, X Yan, H Zhang, B Wang… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
3D single object tracking in LiDAR point clouds (LiDAR SOT) plays a crucial role in
autonomous driving. Current approaches all follow the Siamese paradigm based on …

Semantic-guided reinforced region embedding for generalized zero-shot learning

J Ge, H **e, S Min, Y Zhang - Proceedings of the AAAI Conference on …, 2021 - ojs.aaai.org
Generalized zero-shot Learning (GZSL) aims to recognize images from either seen or
unseen domain, mainly by learning a joint embedding space to associate image features …

Self-supervised discriminative model prediction for visual tracking

D Yuan, G Geng, X Shu, Q Liu, X Chang, Z He… - Neural Computing and …, 2024 - Springer
The discriminative model prediction (DiMP) object tracking model is an excellent end-to-end
tracking framework and have achieved the best results of its time. However, there are two …