[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches

A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …

[HTML][HTML] Estimating optical flow: A comprehensive review of the state of the art

A Alfarano, L Maiano, L Papa, I Amerini - Computer Vision and Image …, 2024 - Elsevier
Optical flow estimation is a crucial task in computer vision that provides low-level motion
information. Despite recent advances, real-world applications still present significant …

Cotracker: It is better to track together

N Karaev, I Rocco, B Graham, N Neverova… - … on Computer Vision, 2024 - Springer
We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …

Synthetic data from diffusion models improves imagenet classification

S Azizi, S Kornblith, C Saharia, M Norouzi… - arxiv preprint arxiv …, 2023 - arxiv.org
Deep generative models are becoming increasingly powerful, now generating diverse high
fidelity photo-realistic samples given text prompts. Have they reached the point where …

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

Gmflow: Learning optical flow via global matching

H Xu, J Zhang, J Cai… - Proceedings of the …, 2022 - openaccess.thecvf.com
Learning-based optical flow estimation has been dominated with the pipeline of cost volume
with convolutions for flow regression, which is inherently limited to local correlations and …

Practical stereo matching via cascaded recurrent network with adaptive correlation

J Li, P Wang, P **ong, T Cai, Z Yan… - Proceedings of the …, 2022 - openaccess.thecvf.com
With the advent of convolutional neural networks, stereo matching algorithms have recently
gained tremendous progress. However, it remains a great challenge to accurately extract …

Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

Perceiver io: A general architecture for structured inputs & outputs

A Jaegle, S Borgeaud, JB Alayrac, C Doersch… - arxiv preprint arxiv …, 2021 - arxiv.org
A central goal of machine learning is the development of systems that can solve many
problems in as many data domains as possible. Current architectures, however, cannot be …