Cotracker: It is better to track together
We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …
in long video sequences. Differently from most existing approaches that track points …
Gmflow: Learning optical flow via global matching
Learning-based optical flow estimation has been dominated with the pipeline of cost volume
with convolutions for flow regression, which is inherently limited to local correlations and …
with convolutions for flow regression, which is inherently limited to local correlations and …
Unifying flow, stereo and depth estimation
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
Transflow: Transformer as flow learner
Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …
including motion estimation, object tracking, and disparity measurement. In this work, we …
Hierarchical neural architecture search for deep stereo matching
To reduce the human efforts in neural network design, Neural Architecture Search (NAS)
has been applied with remarkable success to various high-level vision tasks such as …
has been applied with remarkable success to various high-level vision tasks such as …
Craft: Cross-attentional flow transformer for robust optical flow
Optical flow estimation aims to find the 2D motion field by identifying corresponding pixels
between two images. Despite the tremendous progress of deep learning-based optical flow …
between two images. Despite the tremendous progress of deep learning-based optical flow …
Global matching with overlap** attention for optical flow estimation
Optical flow estimation is a fundamental task in computer vision. Recent direct-regression
methods using deep neural networks achieve remarkable performance improvement …
methods using deep neural networks achieve remarkable performance improvement …
Separable flow: Learning motion cost volumes for optical flow estimation
F Zhang, OJ Woodford… - Proceedings of the …, 2021 - openaccess.thecvf.com
Full-motion cost volumes play a central role in current state-of-the-art optical flow methods.
However, constructed using simple feature correlations, they lack the ability to encapsulate …
However, constructed using simple feature correlations, they lack the ability to encapsulate …
RGB-D saliency detection via cascaded mutual information minimization
Existing RGB-D saliency detection models do not explicitly encourage RGB and depth to
achieve effective multi-modal learning. In this paper, we introduce a novel multi-stage …
achieve effective multi-modal learning. In this paper, we introduce a novel multi-stage …
Taptr: Tracking any point with transformers as detection
In this paper, we propose a simple yet effective approach for Tracking Any Point with
TRansformers (TAPTR). Based on the observation that point tracking bears a great …
TRansformers (TAPTR). Based on the observation that point tracking bears a great …