Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
On evaluating adversarial robustness of large vision-language models
Large vision-language models (VLMs) such as GPT-4 have achieved unprecedented
performance in response generation, especially with visual inputs, enabling more creative …
performance in response generation, especially with visual inputs, enabling more creative …
Text-to-image diffusion models in generative ai: A survey
This survey reviews text-to-image diffusion models in the context that diffusion models have
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …
One transformer fits all distributions in multi-modal diffusion at scale
This paper proposes a unified diffusion framework (dubbed UniDiffuser) to fit all distributions
relevant to a set of multi-modal data in one model. Our key insight is–learning diffusion …
relevant to a set of multi-modal data in one model. Our key insight is–learning diffusion …
Show-o: One single transformer to unify multimodal understanding and generation
We present a unified transformer, ie, Show-o, that unifies multimodal understanding and
generation. Unlike fully autoregressive models, Show-o unifies autoregressive and …
generation. Unlike fully autoregressive models, Show-o unifies autoregressive and …
Online clustered codebook
Vector Quantisation (VQ) is experiencing a comeback in machine learning, where it is
increasingly used in representation learning. However, optimizing the codevectors in …
increasingly used in representation learning. However, optimizing the codevectors in …
A reparameterized discrete diffusion model for text generation
This work studies discrete diffusion probabilistic models with applications to natural
language generation. We derive an alternative yet equivalent formulation of the sampling …
language generation. We derive an alternative yet equivalent formulation of the sampling …
Diffusion models for non-autoregressive text generation: A survey
Non-autoregressive (NAR) text generation has attracted much attention in the field of natural
language processing, which greatly reduces the inference latency but has to sacrifice the …
language processing, which greatly reduces the inference latency but has to sacrifice the …
Diffuseq-v2: Bridging discrete and continuous text spaces for accelerated seq2seq diffusion models
Diffusion models have gained prominence in generating high-quality sequences of text.
Nevertheless, current approaches predominantly represent discrete text within a continuous …
Nevertheless, current approaches predominantly represent discrete text within a continuous …
What does stable diffusion know about the 3d scene?
Recent advances in generative models like Stable Diffusion enable the generation of highly
photo-realistic images. Our objective in this paper is to probe the diffusion network to …
photo-realistic images. Our objective in this paper is to probe the diffusion network to …
Cocktail: Mixing multi-modality control for text-conditional image generation
Text-conditional diffusion models are able to generate high-fidelity images with diverse
contents. However, linguistic representations frequently exhibit ambiguous descriptions of …
contents. However, linguistic representations frequently exhibit ambiguous descriptions of …