Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

X Shi, Z Huang, D Li, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …

Propainter: Improving propagation and transformer for video inpainting

S Zhou, C Li, KCK Chan… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms
in video inpainting (VI). Despite the effectiveness of these components, they still suffer from …

Space-time neural irradiance fields for free-viewpoint video

W **an, JB Huang, J Kopf… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We present a method that learns a spatiotemporal neural irradiance field for dynamic scenes
from a single video. Our learned representation enables free-viewpoint rendering of the …

Towards an end-to-end framework for flow-guided video inpainting

Z Li, CZ Lu, J Qin, CL Guo… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Optical flow, which captures motion information across frames, is exploited in recent video
inpainting methods through propagating pixels along its trajectories. However, the hand …

Learning joint spatial-temporal transformations for video inpainting

Y Zeng, J Fu, H Chao - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
High-quality video inpainting that completes missing regions in video frames is a promising
yet challenging task. State-of-the-art approaches adopt attention models to complete a frame …

Removing objects from neural radiance fields

S Weder, G Garcia-Hernando… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Neural Radiance Fields (NeRFs) are emerging as a ubiquitous scene
representation that allows for novel view synthesis. Increasingly, NeRFs will be shareable …

High-fidelity 3d gan inversion by pseudo-multi-view optimization

J **e, H Ouyang, J Piao, C Lei… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a high-fidelity 3D generative adversarial network (GAN) inversion framework
that can synthesize photo-realistic novel views while preserving specific details of the input …

Fuseformer: Fusing fine-grained information in transformers for video inpainting

R Liu, H Deng, Y Huang, X Shi, L Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Transformer, as a strong and flexible architecture for modelling long-range relations, has
been widely explored in vision tasks. However, when used in video inpainting that requires …

Carafe: Content-aware reassembly of features

J Wang, K Chen, R Xu, Z Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com
Feature upsampling is a key operation in a number of modern convolutional network
architectures, eg feature pyramids. Its design is critical for dense prediction tasks such as …