محقق Google

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024‏ - dl.acm.org‏

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …‏

ذخیره ارجاع بیان شده در 98 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?‏

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …‏

ذخیره ارجاع بیان شده در 210 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Identifying and mitigating vulnerabilities in llm-integrated applications‏

F Jiang - 2024‏ - search.proquest.com‏

Large language models (LLMs) are increasingly deployed as the backend for various
applications, including code completion tools and AI-powered search engines. Unlike …‏

ذخیره ارجاع بیان شده در 1503 یافته مقاله‌های مربوط تمام نسخه‌های 8

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Align your latents: High-resolution video synthesis with latent diffusion models‏

A Blattmann, R Rombach, H Ling… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …‏

ذخیره ارجاع بیان شده در 963 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Next-gpt: Any-to-any multimodal llm‏

S Wu, H Fei, L Qu, W Ji, TS Chua - Forty-first International …, 2024‏ - openreview.net‏

While recently Multimodal Large Language Models (MM-LLMs) have made exciting strides,
they mostly fall prey to the limitation of only input-side multimodal understanding, without the …‏

ذخیره ارجاع بیان شده در 512 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Text2video-zero: Text-to-image diffusion models are zero-shot video generators‏

L Khachatryan, A Movsisyan… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Recent text-to-video generation approaches rely on computationally heavy training and
require large-scale video datasets. In this paper, we introduce a new task, zero-shot text-to …‏

ذخیره ارجاع بیان شده در 492 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Blip-diffusion: Pre-trained subject representation for controllable text-to-image generation and editing‏

D Li, J Li, S Hoi - Advances in Neural Information …, 2023‏ - proceedings.neurips.cc‏

Subject-driven text-to-image generation models create novel renditions of an input subject
based on text prompts. Existing models suffer from lengthy fine-tuning and difficulties …‏

ذخیره ارجاع بیان شده در 246 یافته مقاله‌های مربوط تمام نسخه‌های 5 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Multimodal foundation models: From specialists to general-purpose assistants‏

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024‏ - nowpublishers.com‏

Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …‏

ذخیره ارجاع بیان شده در 231 یافته مقاله‌های مربوط تمام نسخه‌های 7 Library Search‏ نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Emergent correspondence from image diffusion‏

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023‏ - proceedings.neurips.cc‏

Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …‏

ذخیره ارجاع بیان شده در 314 یافته مقاله‌های مربوط تمام نسخه‌های 12 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models‏

C Mou, X Wang, L **e, Y Wu, J Zhang, Z Qi… - Proceedings of the AAAI …, 2024‏ - ojs.aaai.org‏

The incredible generative ability of large-scale text-to-image (T2I) models has demonstrated
strong power of learning complex structures and meaningful semantics. However, relying …‏

ذخیره ارجاع بیان شده در 872 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Prompt-to-prompt image editing with cross attention control

A survey on video diffusion models‏

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?‏

Identifying and mitigating vulnerabilities in llm-integrated applications‏

Align your latents: High-resolution video synthesis with latent diffusion models‏

Next-gpt: Any-to-any multimodal llm‏

Text2video-zero: Text-to-image diffusion models are zero-shot video generators‏

Blip-diffusion: Pre-trained subject representation for controllable text-to-image generation and editing‏

Multimodal foundation models: From specialists to general-purpose assistants‏

Emergent correspondence from image diffusion‏

T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models‏