[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches

A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …

Cotracker: It is better to track together

N Karaev, I Rocco, B Graham, N Neverova… - … on Computer Vision, 2024 - Springer
We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …

Synthetic data from diffusion models improves imagenet classification

S Azizi, S Kornblith, C Saharia, M Norouzi… - arxiv preprint arxiv …, 2023 - arxiv.org
Deep generative models are becoming increasingly powerful, now generating diverse high
fidelity photo-realistic samples given text prompts. Have they reached the point where …

Perceiver io: A general architecture for structured inputs & outputs

A Jaegle, S Borgeaud, JB Alayrac, C Doersch… - arxiv preprint arxiv …, 2021 - arxiv.org
A central goal of machine learning is the development of systems that can solve many
problems in as many data domains as possible. Current architectures, however, cannot be …

Gmflow: Learning optical flow via global matching

H Xu, J Zhang, J Cai… - Proceedings of the …, 2022 - openaccess.thecvf.com
Learning-based optical flow estimation has been dominated with the pipeline of cost volume
with convolutions for flow regression, which is inherently limited to local correlations and …

Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

Practical stereo matching via cascaded recurrent network with adaptive correlation

J Li, P Wang, P **ong, T Cai, Z Yan… - Proceedings of the …, 2022 - openaccess.thecvf.com
With the advent of convolutional neural networks, stereo matching algorithms have recently
gained tremendous progress. However, it remains a great challenge to accurately extract …

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

Residual local feature network for efficient super-resolution

F Kong, M Li, S Liu, D Liu, J He… - Proceedings of the …, 2022 - openaccess.thecvf.com
Deep learning based approaches has achieved great performance in single image super-
resolution (SISR). However, recent advances in efficient super-resolution focus on reducing …

Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …