Seqtrack: Sequence to sequence learning for visual object tracking

X Chen, H Peng, D Wang, H Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

Generalized relation modeling for transformer tracking

S Gao, C Zhou, J Zhang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which
allows earlier interaction between the template and search region, has achieved a …

Mixformerv2: Efficient fully transformer tracking

Y Cui, T Song, G Wu, L Wang - Advances in neural …, 2023 - proceedings.neurips.cc
Transformer-based trackers have achieved strong accuracy on the standard benchmarks.
However, their efficiency remains an obstacle to practical deployment on both GPU and …

Transformer meets remote sensing video detection and tracking: A comprehensive survey

L Jiao, X Zhang, X Liu, F Liu, S Yang… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Transformer has shown excellent performance in remote sensing field with long-range
modeling capabilities. Remote sensing video (RSV) moving object detection and tracking …

Exploring lightweight hierarchical vision transformers for efficient visual tracking

B Kang, X Chen, D Wang, H Peng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Transformer-based visual trackers have demonstrated significant progress owing to their
superior modeling capabilities. However, existing trackers are hampered by low speed …

A survey of the vision transformers and their CNN-transformer based variants

A Khan, Z Rauf, A Sohail, AR Khan, H Asif… - Artificial Intelligence …, 2023 - Springer
Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …

Joint visual grounding and tracking with natural language specification

L Zhou, Z Zhou, K Mao, Z He - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Tracking by natural language specification aims to locate the referred target in a sequence
based on the natural language description. Existing algorithms solve this issue in two steps …

Review and analysis of rgbt single object tracking methods: A fusion perspective

ZH Zhang, J Wang, S Li, L **, H Wu, J Zhao… - ACM Transactions on …, 2024 - dl.acm.org
Visual tracking is a fundamental task in computer vision with significant practical
applications in various domains, including surveillance, security, robotics, and human …

Compact transformer tracker with correlative masked modeling

Z Song, R Luo, J Yu, YPP Chen, W Yang - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Transformer framework has been showing superior performances in visual object tracking
for its great strength in information aggregation across the template and search image with …