Cotracker: It is better to track together

N Karaev, I Rocco, B Graham, N Neverova… - … on Computer Vision, 2024 - Springer
We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …

Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

Gmflow: Learning optical flow via global matching

H Xu, J Zhang, J Cai… - Proceedings of the …, 2022 - openaccess.thecvf.com
Learning-based optical flow estimation has been dominated with the pipeline of cost volume
with convolutions for flow regression, which is inherently limited to local correlations and …

Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

X Shi, Z Huang, D Li, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …

Transflow: Transformer as flow learner

Y Lu, Q Wang, S Ma, T Geng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …

Videoflow: Exploiting temporal cues for multi-frame optical flow estimation

X Shi, Z Huang, W Bian, D Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to
previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently …

Amt: All-pairs multi-field transforms for efficient frame interpolation

Z Li, ZL Zhu, LH Han, Q Hou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for
video frame interpolation. It is based on two essential designs. First, we build bidirectional …

Taptr: Tracking any point with transformers as detection

H Li, H Zhang, S Liu, Z Zeng, T Ren, F Li… - European Conference on …, 2024 - Springer
In this paper, we propose a simple yet effective approach for Tracking Any Point with
TRansformers (TAPTR). Based on the observation that point tracking bears a great …

Learning optical flow with kernel patch attention

A Luo, F Yang, X Li, S Liu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Optical flow is a fundamental method used for quantitative motion estimation on the image
plane. In the deep learning era, most works treat it as a task of'matching of features', learning …