Seqtrack: Sequence to sequence learning for visual object tracking

X Chen, H Peng, D Wang, H Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …

Visual prompt multi-modal tracking

J Zhu, S Lai, X Chen, D Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …

Multimodal prompting with missing modalities for visual recognition

YL Lee, YH Tsai, WC Chiu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we tackle two challenges in multimodal learning for visual recognition: 1) when
missing-modality occurs either during training or testing in real-world situations; and 2) when …

Dual modality prompt tuning for vision-language pre-trained model

Y **ng, Q Wu, D Cheng, S Zhang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
With the emergence of large pretrained vison-language models such as CLIP, transferable
representations can be adapted to a wide range of downstream tasks via prompt tuning …

Single-model and any-modality for video object tracking

Z Wu, J Zheng, X Ren, FA Vasluianu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …

Bi-directional adapter for multimodal tracking

B Cao, J Guo, P Zhu, Q Hu - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Due to the rapid development of computer vision, single-modal (RGB) object tracking has
made significant progress in recent years. Considering the limitation of single imaging …

Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking

X Hou, J **ng, Y Qian, Y Guo, S **n… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …

Efficient multimodal semantic segmentation via dual-prompt learning

S Dong, Y Feng, Q Yang, Y Huang… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
Multimodal (eg, RGB-Depth/RGB-Thermal) fusion has shown great potential for improving
semantic segmentation in complex scenes (eg, indoor/low-light conditions). Existing …

RGBT tracking: A comprehensive review

M Feng, J Su - Information Fusion, 2024 - Elsevier
In recent years, visual object tracking, as a prominent research area in computer vision, has
garnered significant attention. To bolster the robustness of trackers across a spectrum of …