Seqtrack: Sequence to sequence learning for visual object tracking
In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …
Visual prompt multi-modal tracking
Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …
tributaries. To inherit the powerful representations of the foundation model, a natural modus …
Multimodal prompting with missing modalities for visual recognition
In this paper, we tackle two challenges in multimodal learning for visual recognition: 1) when
missing-modality occurs either during training or testing in real-world situations; and 2) when …
missing-modality occurs either during training or testing in real-world situations; and 2) when …
Dual modality prompt tuning for vision-language pre-trained model
With the emergence of large pretrained vison-language models such as CLIP, transferable
representations can be adapted to a wide range of downstream tasks via prompt tuning …
representations can be adapted to a wide range of downstream tasks via prompt tuning …
Single-model and any-modality for video object tracking
In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …
have emerged as valuable assets to complement the RGB trackers. In practice most existing …
Onetracker: Unifying visual object tracking with foundation models and efficient tuning
Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …
appearance in the first frame. Depending on the input modility tracking tasks can be divided …
Bi-directional adapter for multimodal tracking
Due to the rapid development of computer vision, single-modal (RGB) object tracking has
made significant progress in recent years. Considering the limitation of single imaging …
made significant progress in recent years. Considering the limitation of single imaging …
Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking
Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …
Efficient multimodal semantic segmentation via dual-prompt learning
Multimodal (eg, RGB-Depth/RGB-Thermal) fusion has shown great potential for improving
semantic segmentation in complex scenes (eg, indoor/low-light conditions). Existing …
semantic segmentation in complex scenes (eg, indoor/low-light conditions). Existing …
RGBT tracking: A comprehensive review
M Feng, J Su - Information Fusion, 2024 - Elsevier
In recent years, visual object tracking, as a prominent research area in computer vision, has
garnered significant attention. To bolster the robustness of trackers across a spectrum of …
garnered significant attention. To bolster the robustness of trackers across a spectrum of …