- Academic Search

X Chen, H Peng, D Wang, H Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …

Speichern Zitieren Zitiert von: 229 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Visual prompt multi-modal tracking

J Zhu, S Lai, X Chen, D Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …

Speichern Zitieren Zitiert von: 205 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Multimodal prompting with missing modalities for visual recognition

YL Lee, YH Tsai, WC Chiu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

In this paper, we tackle two challenges in multimodal learning for visual recognition: 1) when
missing-modality occurs either during training or testing in real-world situations; and 2) when …

Speichern Zitieren Zitiert von: 115 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Dual modality prompt tuning for vision-language pre-trained model

Y **ng, Q Wu, D Cheng, S Zhang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

With the emergence of large pretrained vison-language models such as CLIP, transferable
representations can be adapted to a wide range of downstream tasks via prompt tuning …

Speichern Zitieren Zitiert von: 103 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] thecvf.com

Single-model and any-modality for video object tracking

Z Wu, J Zheng, X Ren, FA Vasluianu… - Proceedings of the …, 2024 - openaccess.thecvf.com

In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …

Speichern Zitieren Zitiert von: 36 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …

Speichern Zitieren Zitiert von: 37 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] aaai.org

Bi-directional adapter for multimodal tracking

B Cao, J Guo, P Zhu, Q Hu - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Due to the rapid development of computer vision, single-modal (RGB) object tracking has
made significant progress in recent years. Considering the limitation of single imaging …

Speichern Zitieren Zitiert von: 44 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking

X Hou, J **ng, Y Qian, Y Guo, S **n… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …

Speichern Zitieren Zitiert von: 30 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Efficient multimodal semantic segmentation via dual-prompt learning

S Dong, Y Feng, Q Yang, Y Huang… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org

Multimodal (eg, RGB-Depth/RGB-Thermal) fusion has shown great potential for improving
semantic segmentation in complex scenes (eg, indoor/low-light conditions). Existing …

Speichern Zitieren Zitiert von: 15 Ähnliche Artikel Alle 2 Versionen

RGBT tracking: A comprehensive review

M Feng, J Su - Information Fusion, 2024 - Elsevier

In recent years, visual object tracking, as a prominent research area in computer vision, has
garnered significant attention. To bolster the robustness of trackers across a spectrum of …

Speichern Zitieren Zitiert von: 5 Ähnliche Artikel

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Prompting for multi-modal tracking

Seqtrack: Sequence to sequence learning for visual object tracking

Visual prompt multi-modal tracking

Multimodal prompting with missing modalities for visual recognition

Dual modality prompt tuning for vision-language pre-trained model

Single-model and any-modality for video object tracking

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

Bi-directional adapter for multimodal tracking

Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking

Efficient multimodal semantic segmentation via dual-prompt learning

RGBT tracking: A comprehensive review