Satellite video single object tracking: A systematic review and an oriented object tracking benchmark
Single object tracking (SOT) in satellite video (SV) enables the continuous acquisition of
position and range information of an arbitrary object, showing promising value in remote …
position and range information of an arbitrary object, showing promising value in remote …
Sed: A simple encoder-decoder for open-vocabulary semantic segmentation
Open-vocabulary semantic segmentation strives to distinguish pixels into different semantic
groups from an open set of categories. Most existing methods explore utilizing pre-trained …
groups from an open set of categories. Most existing methods explore utilizing pre-trained …
Tracking meets lora: Faster training, larger model, stronger performance
Abstract Motivated by the Parameter-Efficient Fine-Tuning (PEFT) in large language models,
we propose LoRAT, a method that unveils the power of larger Vision Transformers (ViT) for …
we propose LoRAT, a method that unveils the power of larger Vision Transformers (ViT) for …
Omnivid: A generative framework for universal video understanding
The core of video understanding tasks such as recognition captioning and tracking is to
automatically detect objects or actions in a video and analyze their temporal evolution …
automatically detect objects or actions in a video and analyze their temporal evolution …
Towards real-world visual tracking with temporal contexts
Visual tracking has made significant improvements in the past few decades. Most existing
state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking …
state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking …
DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking
Existing Siamese or transformer trackers commonly pose visual object tracking as a one-
shot detection problem ie locating the target object in a single forward evaluation scheme …
shot detection problem ie locating the target object in a single forward evaluation scheme …
Learning symmetry-aware geometry correspondences for 6d object pose estimation
Current 6D pose estimation methods focus on handling objects that are previously trained,
which limits their applications in real dynamic world. To this end, we propose a geometry …
which limits their applications in real dynamic world. To this end, we propose a geometry …
Artrackv2: Prompting autoregressive tracker where to look and how to describe
We present ARTrackV2 which integrates two pivotal aspects of tracking: determining where
to look (localization) and how to describe (appearance analysis) the target object across …
to look (localization) and how to describe (appearance analysis) the target object across …
Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking
Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …
Odtrack: Online dense temporal token learning for visual tracking
Online contextual reasoning and association across consecutive video frames are critical to
perceive instances in visual tracking. However, most current top-performing trackers …
perceive instances in visual tracking. However, most current top-performing trackers …