Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models
Large-scale multimodal generative modeling has created milestones in text-to-image and
text-to-video generation. Its application to audio still lags behind for two main reasons: the …
text-to-video generation. Its application to audio still lags behind for two main reasons: the …
Repaint: Inpainting using denoising diffusion probabilistic models
Free-form inpainting is the task of adding new content to an image in the regions specified
by an arbitrary binary mask. Most existing approaches train for a certain distribution of …
by an arbitrary binary mask. Most existing approaches train for a certain distribution of …
Mat: Mask-aware transformer for large hole image inpainting
Recent studies have shown the importance of modeling long-range interactions in the
inpainting problem. To achieve this goal, existing approaches exploit either standalone …
inpainting problem. To achieve this goal, existing approaches exploit either standalone …
Palette: Image-to-image diffusion models
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …
conditional diffusion models and evaluates this framework on four challenging image-to …
Resolution-robust large mask inpainting with fourier convolutions
Modern image inpainting systems, despite the significant progress, often struggle with large
missing areas, complex geometric structures, and high-resolution images. We find that one …
missing areas, complex geometric structures, and high-resolution images. We find that one …
Image inpainting via conditional texture and structure dual generation
Deep generative approaches have recently made considerable progress in image
inpainting by introducing structure priors. Due to the lack of proper interaction with image …
inpainting by introducing structure priors. Due to the lack of proper interaction with image …
Deep learning for image inpainting: A survey
Image inpainting has been widely exploited in the field of computer vision and image
processing. The main purpose of image inpainting is to produce visually plausible structure …
processing. The main purpose of image inpainting is to produce visually plausible structure …
Videomoco: Contrastive video representation learning with temporally adversarial examples
MoCo is effective for unsupervised image representation learning. In this paper, we propose
VideoMoCo for unsupervised video representation learning. Given a video sequence as an …
VideoMoCo for unsupervised video representation learning. Given a video sequence as an …
Pd-gan: Probabilistic diverse gan for image inpainting
We propose PD-GAN, a probabilistic diverse GAN forimage inpainting. Given an input image
with arbitrary holeregions, PD-GAN produces multiple inpainting results withdiverse and …
with arbitrary holeregions, PD-GAN produces multiple inpainting results withdiverse and …
High-fidelity pluralistic image completion with transformers
Image completion has made tremendous progress with convolutional neural networks
(CNNs), because of their powerful texture modeling capacity. However, due to some …
(CNNs), because of their powerful texture modeling capacity. However, due to some …