CoTracker3: Simpler and better point tracking by pseudo-labelling real videos

N Karaev, I Makarov, J Wang, N Neverova… - arxiv preprint arxiv …, 2024 - arxiv.org
Most state-of-the-art point trackers are trained on synthetic data due to the difficulty of
annotating real videos for this task. However, this can result in suboptimal performance due …

Taptrv2: Attention-based position update improves tracking any point

H Li, H Zhang, S Liu, Z Zeng, F Li, T Ren, B Li… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we present TAPTRv2, a Transformer-based approach built upon TAPTR for
solving the Tracking Any Point (TAP) task. TAPTR borrows designs from DEtection …

X-pose: Detecting any keypoints

J Yang, A Zeng, R Zhang, L Zhang - European Conference on Computer …, 2024 - Springer
This work aims to address an advanced keypoint detection problem: how to accurately
detect any keypoints in complex real-world scenarios, which involves massive, messy, and …

Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation

H Jeong, CHP Huang, JC Ye, N Mitra… - arxiv preprint arxiv …, 2024 - arxiv.org
While recent foundational video generators produce visually rich output, they still struggle
with appearance drift, where objects gradually degrade or change inconsistently across …

ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

T Zhang, C Wang, Z Dou, Q Gao, J Lei, B Chen… - arxiv preprint arxiv …, 2025 - arxiv.org
In this paper, we propose ProTracker, a novel framework for robust and accurate long-term
dense tracking of arbitrary points in videos. The key idea of our method is incorporating …

DELTA: Dense Efficient Long-range 3D Tracking for any video

TD Ngo, P Zhuang, C Gan, E Kalogerakis… - arxiv preprint arxiv …, 2024 - arxiv.org
Tracking dense 3D motion from monocular videos remains challenging, particularly when
aiming for pixel-level precision over long sequences. We introduce DELTA, a novel method …

Track-On: Transformer-based Online Point Tracking with Memory

G Aydemir, X Cai, W **e, F Güney - arxiv preprint arxiv:2501.18487, 2025 - arxiv.org
In this paper, we consider the problem of long-term point tracking, which requires consistent
identification of points across multiple frames in a video, despite changes in appearance …

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

J Qu, H Li, S Liu, T Ren, Z Zeng, L Zhang - arxiv preprint arxiv:2411.18671, 2024 - arxiv.org
In this paper, we present TAPTRv3, which is built upon TAPTRv2 to improve its point
tracking robustness in long videos. TAPTRv2 is a simple DETR-like framework that can …

MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation

J Serych, M Neoral, J Matas - arxiv preprint arxiv:2411.09551, 2024 - arxiv.org
In this work, we present MFTIQ, a novel dense long-term tracking model that advances the
Multi-Flow Tracker (MFT) framework to address challenges in point-level visual tracking in …

Exploring Temporally-Aware Features for Point Tracking

IH Kim, S Cho, J Huang, J Yi, JY Lee, S Kim - arxiv preprint arxiv …, 2025 - arxiv.org
Point tracking in videos is a fundamental task with applications in robotics, video editing, and
more. While many vision tasks benefit from pre-trained feature backbones to improve …