Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models

R Huang, J Huang, D Yang, Y Ren… - International …, 2023 - proceedings.mlr.press
Large-scale multimodal generative modeling has created milestones in text-to-image and
text-to-video generation. Its application to audio still lags behind for two main reasons: the …

Repaint: Inpainting using denoising diffusion probabilistic models

A Lugmayr, M Danelljan, A Romero… - Proceedings of the …, 2022 - openaccess.thecvf.com
Free-form inpainting is the task of adding new content to an image in the regions specified
by an arbitrary binary mask. Most existing approaches train for a certain distribution of …

Mat: Mask-aware transformer for large hole image inpainting

W Li, Z Lin, K Zhou, L Qi, Y Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recent studies have shown the importance of modeling long-range interactions in the
inpainting problem. To achieve this goal, existing approaches exploit either standalone …

Palette: Image-to-image diffusion models

C Saharia, W Chan, H Chang, C Lee, J Ho… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …

Resolution-robust large mask inpainting with fourier convolutions

R Suvorov, E Logacheva, A Mashikhin… - Proceedings of the …, 2022 - openaccess.thecvf.com
Modern image inpainting systems, despite the significant progress, often struggle with large
missing areas, complex geometric structures, and high-resolution images. We find that one …

Image inpainting via conditional texture and structure dual generation

X Guo, H Yang, D Huang - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Deep generative approaches have recently made considerable progress in image
inpainting by introducing structure priors. Due to the lack of proper interaction with image …

Deep learning for image inpainting: A survey

H **ang, Q Zou, MA Nawaz, X Huang, F Zhang, H Yu - Pattern Recognition, 2023 - Elsevier
Image inpainting has been widely exploited in the field of computer vision and image
processing. The main purpose of image inpainting is to produce visually plausible structure …

Videomoco: Contrastive video representation learning with temporally adversarial examples

T Pan, Y Song, T Yang, W Jiang… - Proceedings of the …, 2021 - openaccess.thecvf.com
MoCo is effective for unsupervised image representation learning. In this paper, we propose
VideoMoCo for unsupervised video representation learning. Given a video sequence as an …

Pd-gan: Probabilistic diverse gan for image inpainting

H Liu, Z Wan, W Huang, Y Song… - Proceedings of the …, 2021 - openaccess.thecvf.com
We propose PD-GAN, a probabilistic diverse GAN forimage inpainting. Given an input image
with arbitrary holeregions, PD-GAN produces multiple inpainting results withdiverse and …

High-fidelity pluralistic image completion with transformers

Z Wan, J Zhang, D Chen, J Liao - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Image completion has made tremendous progress with convolutional neural networks
(CNNs), because of their powerful texture modeling capacity. However, due to some …