Google Académico

S Khan, M Naseer, M Hayat, SW Zamir… - ACM computing …, 2022 - dl.acm.org

Astounding results from Transformer models on natural language tasks have intrigued the
vision community to study their application to computer vision problems. Among their salient …

Guardar Citar Citado por 2916 Artículos relacionados Las 8 versiones

[Free GPT-4]

[PDF] arxiv.org

Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Guardar Citar Citado por 266 Artículos relacionados Las 11 versiones

[Free GPT-4]

[PDF] thecvf.com

Styleclip: Text-driven manipulation of stylegan imagery

O Patashnik, Z Wu, E Shechtman… - Proceedings of the …, 2021 - openaccess.thecvf.com

Inspired by the ability of StyleGAN to generate highly re-alistic images in a variety of
domains, much recent work hasfocused on understanding how to use the latent spaces …

Guardar Citar Citado por 1293 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Understanding and creating art with AI: Review and outlook

E Cetinic, J She - ACM Transactions on Multimedia Computing …, 2022 - dl.acm.org

Technologies related to artificial intelligence (AI) have a strong impact on the changes of
research and creative practices in visual arts. The growing number of research initiatives …

Guardar Citar Citado por 499 Artículos relacionados Las 11 versiones

[Free GPT-4]

[PDF] aaai.org

Frozen pretrained transformers as universal computation engines

K Lu, A Grover, P Abbeel, I Mordatch - Proceedings of the AAAI …, 2022 - ojs.aaai.org

We investigate the capability of a transformer pretrained on natural language to generalize
to other modalities with minimal finetuning--in particular, without finetuning of the self …

Guardar Citar Citado por 321 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]

[PDF] researchgate.net

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer

In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Guardar Citar Citado por 225 Artículos relacionados Las 8 versiones

[Free GPT-4]

[PDF] arxiv.org

Symbolic music generation with diffusion models

G Mittal, J Engel, C Hawthorne, I Simon - arxiv preprint arxiv:2103.16091, 2021 - arxiv.org

Score-based generative models and diffusion probabilistic models have been successful at
generating high-quality samples in continuous domains such as images and audio …

Guardar Citar Citado por 211 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Generating images with sparse representations

C Nash, J Menick, S Dieleman, PW Battaglia - arxiv preprint arxiv …, 2021 - arxiv.org

The high dimensionality of images presents architecture and sampling-efficiency challenges
for likelihood-based generative models. Previous approaches such as VQ-VAE use deep …

Guardar Citar Citado por 197 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] aaai.org

How to Protect Copyright Data in Optimization of Large Language Models?

T Chu, Z Song, C Yang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

The softmax operator is a crucial component of large language models (LLMs), which have
played a transformative role in computer research. Due to the centrality of the softmax …

Guardar Citar Citado por 42 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Attention approximates sparse distributed memory

T Bricken, C Pehlevan - Advances in Neural Information …, 2021 - proceedings.neurips.cc

While Attention has come to be an important mechanism in deep learning, there remains
limited intuition for why it works so well. Here, we show that Transformer Attention can be …

Guardar Citar Citado por 43 Artículos relacionados Las 8 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

DALL· E: Creating images from text

Transformers in vision: A survey

Multimodal image synthesis and editing: A survey and taxonomy

Styleclip: Text-driven manipulation of stylegan imagery

Understanding and creating art with AI: Review and outlook

Frozen pretrained transformers as universal computation engines

Attention, please! A survey of neural attention models in deep learning

Symbolic music generation with diffusion models

Generating images with sparse representations

How to Protect Copyright Data in Optimization of Large Language Models?

Attention approximates sparse distributed memory