Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Modulated contrast for versatile image synthesis

F Zhan, J Zhang, Y Yu, R Wu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Perceiving the similarity between images has been a long-standing and fundamental
problem underlying various visual generation tasks. Predominant approaches measure the …

Video prediction by efficient transformers

X Ye, GA Bilodeau - Image and Vision Computing, 2023 - Elsevier
Video prediction is a challenging computer vision task that has a wide range of applications.
In this work, we present a new family of Transformer-based models for video prediction …

Towards counterfactual image manipulation via clip

Y Yu, F Zhan, R Wu, J Zhang, S Lu, M Cui… - Proceedings of the 30th …, 2022 - dl.acm.org
Leveraging StyleGAN's expressivity and its disentangled latent codes, existing methods can
achieve realistic editing of different visual attributes such as age and gender of facial …

Generating high-resolution synthetic CT from lung MRI with ultrashort echo times: initial evaluation in cystic fibrosis

A Longuefosse, J Raoult, I Benlala… - Radiology, 2023 - pubs.rsna.org
Background Lung MRI with ultrashort echo times (UTEs) enables high-resolution and
radiation-free morphologic imaging; however, its image quality is still lower than that of CT …

TSNeRF: Text-driven stylized neural radiance fields via semantic contrastive learning

Y Wang, JS Cheng, Q Feng, WY Tao, YK Lai, K Li - Computers & Graphics, 2023 - Elsevier
Abstract 3D scene stylization aims to generate impressive stylized images from arbitrary
novel views based on the stylistic reference. Existing image-driven 3D scene stylization …

Enhancement of guided thermal image super-resolution approaches

PL Suárez, D Carpio, AD Sappa - Neurocomputing, 2024 - Elsevier
Guided image processing techniques are widely used to extract meaningful information from
a guiding image and facilitate the enhancement of the guided one. This paper specifically …

Vptr: Efficient transformers for video prediction

X Ye, GA Bilodeau - 2022 26th International Conference on …, 2022 - ieeexplore.ieee.org
In this paper, we propose a new Transformer block for video future frames prediction based
on an efficient local spatial-temporal separation attention mechanism. Based on this new …

Criteria comparative learning for real-scene image super-resolution

Y Shi, H Li, S Zhang, Z Yang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Real-scene image super-resolution aims to restore real-world low-resolution images into
their high-quality versions. A typical RealSR framework usually includes the optimization of …

Contrastive distortion‐level learning‐based no‐reference image‐quality assessment

X Wei, J Li, M Zhou, X Wang - International Journal of …, 2022 - Wiley Online Library
A contrastive distortion‐level learning‐based no‐reference image‐quality assessment (NR‐
IQA) framework is proposed in this study to further effectively model various distortion types …