Dae-gan: Dynamic aspect-aware gan for text-to-image synthesis

S Ruan, Y Zhang, K Zhang, Y Fan… - Proceedings of the …, 2021 - openaccess.thecvf.com
Text-to-image synthesis refers to generating an image from a given text description, the key
goal of which lies in photo realism and semantic consistency. Previous methods usually …

Neural architecture search with a lightweight transformer for text-to-image synthesis

W Li, S Wen, K Shi, Y Yang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite the cross-modal text-to-imagesynthesis task has achieved great success, most of
the latest works in this field are based on the network architectures proposed by …

Coutfitgan: learning to synthesize compatible outfits supervised by silhouette masks and fashion styles

D Zhou, H Zhang, Q Li, J Ma… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
How to recommend outfits has gained considerable attention in both academia and industry
in recent years. Many studies have been carried out regarding fashion compatibility …

Faceclip: Facial image-to-video translation via a brief text description

J Guo, H Manukyan, C Yang, C Wang… - … on Circuits and …, 2023 - ieeexplore.ieee.org
The existing image-to-video translation methods generally follow a frame-by-frame
generative paradigm, while extracting the temporal information from a reference video or an …

Dse-gan: Dynamic semantic evolution generative adversarial network for text-to-image generation

M Huang, Z Mao, P Wang, Q Wang… - Proceedings of the 30th …, 2022 - dl.acm.org
Text-to-image generation aims at generating realistic images which are semantically
consistent with the given text. Previous works mainly adopt the multi-stage architecture by …

CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis

Y Zhang, S Han, Z Zhang, J Wang, H Bi - The Visual Computer, 2023 - Springer
In recent years, generative adversarial networks have successfully synthesized images
through text descriptions. However, there are still problems that the generated image cannot …

A review of multi-modal learning from the text-guided visual processing viewpoint

U Ullah, JS Lee, CH An, H Lee, SY Park, RH Baek… - Sensors, 2022 - mdpi.com
For decades, co-relating different data domains to attain the maximum potential of machines
has driven research, especially in neural networks. Similarly, text and visual data (images …

Vision-language matching for text-to-image synthesis via generative adversarial networks

Q Cheng, K Wen, X Gu - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
Text-to-image synthesis is an attractive but challenging task that aims to generate a photo-
realistic and semantic consistent image from a specific text description. The images …

Vision+ language applications: A survey

Y Zhou, N Shimada - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Text-to-image generation has attracted significant interest from researchers and practitioners
in recent years due to its widespread and diverse applications across various industries …

Neural networks-based data hiding in digital images: overview

K Dzhanashia, O Evsutin - Neurocomputing, 2024 - Elsevier
Nowadays, neural networks are actively used for data hiding; however, there is currently no
systematic knowledge regarding their utilization in this field. This is a significant gap …