Seqtrack: Sequence to sequence learning for visual object tracking
In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …
Mixformer: End-to-end tracking with iterative mixed attention
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …
integration, and bounding box estimation. To simplify this pipeline and unify the process of …
Generalized relation modeling for transformer tracking
Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which
allows earlier interaction between the template and search region, has achieved a …
allows earlier interaction between the template and search region, has achieved a …
Mixformerv2: Efficient fully transformer tracking
Transformer-based trackers have achieved strong accuracy on the standard benchmarks.
However, their efficiency remains an obstacle to practical deployment on both GPU and …
However, their efficiency remains an obstacle to practical deployment on both GPU and …
Transformer meets remote sensing video detection and tracking: A comprehensive survey
Transformer has shown excellent performance in remote sensing field with long-range
modeling capabilities. Remote sensing video (RSV) moving object detection and tracking …
modeling capabilities. Remote sensing video (RSV) moving object detection and tracking …
Exploring lightweight hierarchical vision transformers for efficient visual tracking
Transformer-based visual trackers have demonstrated significant progress owing to their
superior modeling capabilities. However, existing trackers are hampered by low speed …
superior modeling capabilities. However, existing trackers are hampered by low speed …
A survey of the vision transformers and their CNN-transformer based variants
Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …
networks (CNNs) for a variety of computer vision applications. These transformers, with their …
Joint visual grounding and tracking with natural language specification
Tracking by natural language specification aims to locate the referred target in a sequence
based on the natural language description. Existing algorithms solve this issue in two steps …
based on the natural language description. Existing algorithms solve this issue in two steps …
Review and analysis of rgbt single object tracking methods: A fusion perspective
Visual tracking is a fundamental task in computer vision with significant practical
applications in various domains, including surveillance, security, robotics, and human …
applications in various domains, including surveillance, security, robotics, and human …
Compact transformer tracker with correlative masked modeling
Transformer framework has been showing superior performances in visual object tracking
for its great strength in information aggregation across the template and search image with …
for its great strength in information aggregation across the template and search image with …