Cotracker: It is better to track together
We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …
in long video sequences. Differently from most existing approaches that track points …
Flowformer: A transformer architecture for optical flow
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …
Gmflow: Learning optical flow via global matching
Learning-based optical flow estimation has been dominated with the pipeline of cost volume
with convolutions for flow regression, which is inherently limited to local correlations and …
with convolutions for flow regression, which is inherently limited to local correlations and …
Unifying flow, stereo and depth estimation
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation
FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …
state-of-the-art performance. The core component of FlowFormer is the transformer-based …
Transflow: Transformer as flow learner
Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …
including motion estimation, object tracking, and disparity measurement. In this work, we …
Videoflow: Exploiting temporal cues for multi-frame optical flow estimation
We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to
previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently …
previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently …
Amt: All-pairs multi-field transforms for efficient frame interpolation
Abstract We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for
video frame interpolation. It is based on two essential designs. First, we build bidirectional …
video frame interpolation. It is based on two essential designs. First, we build bidirectional …
Taptr: Tracking any point with transformers as detection
In this paper, we propose a simple yet effective approach for Tracking Any Point with
TRansformers (TAPTR). Based on the observation that point tracking bears a great …
TRansformers (TAPTR). Based on the observation that point tracking bears a great …
Learning optical flow with kernel patch attention
Optical flow is a fundamental method used for quantitative motion estimation on the image
plane. In the deep learning era, most works treat it as a task of'matching of features', learning …
plane. In the deep learning era, most works treat it as a task of'matching of features', learning …