Attention mechanism in neural networks: where it comes and where it goes
D Soydaner - Neural Computing and Applications, 2022 - Springer
A long time ago in the machine learning literature, the idea of incorporating a mechanism
inspired by the human visual system into neural networks was introduced. This idea is …
inspired by the human visual system into neural networks was introduced. This idea is …
A review on generative adversarial networks: Algorithms, theory, and applications
Generative adversarial networks (GANs) have recently become a hot research topic;
however, they have been studied since 2014, and a large number of algorithms have been …
however, they have been studied since 2014, and a large number of algorithms have been …
Text2video-zero: Text-to-image diffusion models are zero-shot video generators
L Khachatryan, A Movsisyan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent text-to-video generation approaches rely on computationally heavy training and
require large-scale video datasets. In this paper, we introduce a new task, zero-shot text-to …
require large-scale video datasets. In this paper, we introduce a new task, zero-shot text-to …
Scaling up gans for text-to-image synthesis
The recent success of text-to-image synthesis has taken the world by storm and captured the
general public's imagination. From a technical standpoint, it also marked a drastic change in …
general public's imagination. From a technical standpoint, it also marked a drastic change in …
A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications
Data scarcity is a major challenge when training deep learning (DL) models. DL demands a
large amount of data to achieve exceptional performance. Unfortunately, many applications …
large amount of data to achieve exceptional performance. Unfortunately, many applications …
T2i-compbench: A comprehensive benchmark for open-world compositional text-to-image generation
Despite the stunning ability to generate high-quality images by recent text-to-image models,
current approaches often struggle to effectively compose objects with different attributes and …
current approaches often struggle to effectively compose objects with different attributes and …
Stylegan-t: Unlocking the power of gans for fast large-scale text-to-image synthesis
Text-to-image synthesis has recently seen significant progress thanks to large pretrained
language models, large-scale training data, and the introduction of scalable model families …
language models, large-scale training data, and the introduction of scalable model families …
[PDF][PDF] Scaling autoregressive models for content-rich text-to-image generation
Abstract We present the Pathways [1] Autoregressive Text-to-Image (Parti) model, which
generates high-fidelity photorealistic images and supports content-rich synthesis involving …
generates high-fidelity photorealistic images and supports content-rich synthesis involving …
Instaflow: One step is enough for high-quality diffusion-based text-to-image generation
Diffusion models have revolutionized text-to-image generation with its exceptional quality
and creativity. However, its multi-step sampling process is known to be slow, often requiring …
and creativity. However, its multi-step sampling process is known to be slow, often requiring …
Layoutdiffusion: Controllable diffusion model for layout-to-image generation
Recently, diffusion models have achieved great success in image synthesis. However, when
it comes to the layout-to-image generation where an image often has a complex scene of …
it comes to the layout-to-image generation where an image often has a complex scene of …