Satellite video single object tracking: A systematic review and an oriented object tracking benchmark

Y Chen, Y Tang, Y **ao, Q Yuan, Y Zhang, F Liu… - ISPRS Journal of …, 2024 - Elsevier
Single object tracking (SOT) in satellite video (SV) enables the continuous acquisition of
position and range information of an arbitrary object, showing promising value in remote …

Sed: A simple encoder-decoder for open-vocabulary semantic segmentation

B **e, J Cao, J **e, FS Khan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Open-vocabulary semantic segmentation strives to distinguish pixels into different semantic
groups from an open set of categories. Most existing methods explore utilizing pre-trained …

Tracking meets lora: Faster training, larger model, stronger performance

L Lin, H Fan, Z Zhang, Y Wang, Y Xu, H Ling - European Conference on …, 2024 - Springer
Abstract Motivated by the Parameter-Efficient Fine-Tuning (PEFT) in large language models,
we propose LoRAT, a method that unveils the power of larger Vision Transformers (ViT) for …

Omnivid: A generative framework for universal video understanding

J Wang, D Chen, C Luo, B He, L Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com
The core of video understanding tasks such as recognition captioning and tracking is to
automatically detect objects or actions in a video and analyze their temporal evolution …

Towards real-world visual tracking with temporal contexts

Z Cao, Z Huang, L Pan, S Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Visual tracking has made significant improvements in the past few decades. Most existing
state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking …

DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking

F **e, Z Wang, C Ma - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Existing Siamese or transformer trackers commonly pose visual object tracking as a one-
shot detection problem ie locating the target object in a single forward evaluation scheme …

Learning symmetry-aware geometry correspondences for 6d object pose estimation

H Zhao, S Wei, D Shi, W Tan, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current 6D pose estimation methods focus on handling objects that are previously trained,
which limits their applications in real dynamic world. To this end, we propose a geometry …

Artrackv2: Prompting autoregressive tracker where to look and how to describe

Y Bai, Z Zhao, Y Gong, X Wei - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present ARTrackV2 which integrates two pivotal aspects of tracking: determining where
to look (localization) and how to describe (appearance analysis) the target object across …

Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking

X Hou, J **ng, Y Qian, Y Guo, S **n… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …

Odtrack: Online dense temporal token learning for visual tracking

Y Zheng, B Zhong, Q Liang, Z Mo, S Zhang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Online contextual reasoning and association across consecutive video frames are critical to
perceive instances in visual tracking. However, most current top-performing trackers …