Flowformer: A transformer architecture for optical flow
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …
Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation
FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …
state-of-the-art performance. The core component of FlowFormer is the transformer-based …
Propainter: Improving propagation and transformer for video inpainting
Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms
in video inpainting (VI). Despite the effectiveness of these components, they still suffer from …
in video inpainting (VI). Despite the effectiveness of these components, they still suffer from …
Space-time neural irradiance fields for free-viewpoint video
We present a method that learns a spatiotemporal neural irradiance field for dynamic scenes
from a single video. Our learned representation enables free-viewpoint rendering of the …
from a single video. Our learned representation enables free-viewpoint rendering of the …
Towards an end-to-end framework for flow-guided video inpainting
Optical flow, which captures motion information across frames, is exploited in recent video
inpainting methods through propagating pixels along its trajectories. However, the hand …
inpainting methods through propagating pixels along its trajectories. However, the hand …
Learning joint spatial-temporal transformations for video inpainting
High-quality video inpainting that completes missing regions in video frames is a promising
yet challenging task. State-of-the-art approaches adopt attention models to complete a frame …
yet challenging task. State-of-the-art approaches adopt attention models to complete a frame …
Removing objects from neural radiance fields
Abstract Neural Radiance Fields (NeRFs) are emerging as a ubiquitous scene
representation that allows for novel view synthesis. Increasingly, NeRFs will be shareable …
representation that allows for novel view synthesis. Increasingly, NeRFs will be shareable …
High-fidelity 3d gan inversion by pseudo-multi-view optimization
We present a high-fidelity 3D generative adversarial network (GAN) inversion framework
that can synthesize photo-realistic novel views while preserving specific details of the input …
that can synthesize photo-realistic novel views while preserving specific details of the input …
Fuseformer: Fusing fine-grained information in transformers for video inpainting
Transformer, as a strong and flexible architecture for modelling long-range relations, has
been widely explored in vision tasks. However, when used in video inpainting that requires …
been widely explored in vision tasks. However, when used in video inpainting that requires …
Carafe: Content-aware reassembly of features
Feature upsampling is a key operation in a number of modern convolutional network
architectures, eg feature pyramids. Its design is critical for dense prediction tasks such as …
architectures, eg feature pyramids. Its design is critical for dense prediction tasks such as …