Seqtrack: Sequence to sequence learning for visual object tracking

X Chen, H Peng, D Wang, H Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …

Autoregressive visual tracking

X Wei, Y Bai, Y Zheng, D Shi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We present ARTrack, an autoregressive framework for visual object tracking. ARTrack
tackles tracking as a coordinate sequence interpretation task that estimates object …

Generalized relation modeling for transformer tracking

S Gao, C Zhou, J Zhang - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which
allows earlier interaction between the template and search region, has achieved a …

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

Joint feature learning and relation modeling for tracking: A one-stream framework

B Ye, H Chang, B Ma, S Shan, X Chen - European conference on …, 2022 - Springer
The current popular two-stream, two-stage tracking framework extracts the template and the
search region features separately and then performs relation modeling, thus the extracted …

Dropmae: Masked autoencoders with spatial-attention dropout for tracking tasks

Q Wu, T Yang, Z Liu, B Wu, Y Shan… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we study masked autoencoder (MAE) pretraining on videos for matching-
based downstream tasks, including visual object tracking (VOT) and video object …

Transforming model prediction for tracking

C Mayer, M Danelljan, G Bhat, M Paul… - Proceedings of the …, 2022 - openaccess.thecvf.com
Optimization based tracking methods have been widely successful by integrating a target
model prediction module, providing effective global reasoning by minimizing an objective …

Swintrack: A simple and strong baseline for transformer tracking

L Lin, H Fan, Z Zhang, Y Xu… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recently Transformer has been largely explored in tracking and shown state-of-the-art
(SOTA) performance. However, existing efforts mainly focus on fusing and enhancing …

Backbone is all your need: A simplified architecture for visual object tracking

B Chen, P Li, L Bai, L Qiao, Q Shen, B Li, W Gan… - European conference on …, 2022 - Springer
Exploiting a general-purpose neural architecture to replace hand-wired designs or inductive
biases has recently drawn extensive interest. However, existing tracking approaches rely on …

Transformer tracking with cyclic shifting window attention

Z Song, J Yu, YPP Chen… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Transformer architecture has been showing its great strength in visual object tracking, for its
effective attention mechanism. Existing transformer-based approaches adopt the pixel-to …