الباحث العلمي من Google

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …‏

حفظ اقتباس تم اقتباسها في عدد: 484 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4‏

KS Kalyan - Natural Language Processing Journal, 2024‏ - Elsevier‏

Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …‏

حفظ اقتباس تم اقتباسها في عدد: 265 مقالات ذات صلة الإصدارات الـ 4كلها

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Next-gpt: Any-to-any multimodal llm‏

S Wu, H Fei, L Qu, W Ji, TS Chua - Forty-first International …, 2024‏ - openreview.net‏

While recently Multimodal Large Language Models (MM-LLMs) have made exciting strides,
they mostly fall prey to the limitation of only input-side multimodal understanding, without the …‏

حفظ اقتباس تم اقتباسها في عدد: 498 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Multimodal foundation models: From specialists to general-purpose assistants‏

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024‏ - nowpublishers.com‏

Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …‏

حفظ اقتباس تم اقتباسها في عدد: 228 مقالات ذات صلة الإصدارات الـ 7كلها بحث عن المكتبات إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Boxdiff: Text-to-image synthesis with training-free box-constrained diffusion‏

J **e, Y Li, Y Huang, H Liu, W Zhang… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Recent text-to-image diffusion models have demonstrated an astonishing capacity to
generate high-quality images. However, researchers mainly studied the way of synthesizing …‏

حفظ اقتباس تم اقتباسها في عدد: 165 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Mastering text-to-image diffusion: Recaptioning, planning, and generating with multimodal llms‏

L Yang, Z Yu, C Meng, M Xu, S Ermon… - Forty-first International …, 2024‏ - openreview.net‏

Diffusion models have exhibit exceptional performance in text-to-image generation and
editing. However, existing methods often face challenges when handling complex text …‏

حفظ اقتباس تم اقتباسها في عدد: 90 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Instancediffusion: Instance-level control for image generation‏

X Wang, T Darrell, SS Rambhatla… - Proceedings of the …, 2024‏ - openaccess.thecvf.com‏

Text-to-image diffusion models produce high quality images but do not offer control over
individual instances in the image. We introduce InstanceDiffusion that adds precise instance …‏

حفظ اقتباس تم اقتباسها في عدد: 61 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Grounded text-to-image synthesis with attention refocusing‏

Q Phung, S Ge, JB Huang - … of the IEEE/CVF Conference on …, 2024‏ - openaccess.thecvf.com‏

Driven by the scalable diffusion models trained on large-scale datasets text-to-image
synthesis methods have shown compelling results. However these models still fail to …‏

حفظ اقتباس تم اقتباسها في عدد: 88 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Llmscore: Unveiling the power of large language models in text-to-image synthesis evaluation‏

Y Lu, X Yang, X Li, XE Wang… - Advances in Neural …, 2023‏ - proceedings.neurips.cc‏

Existing automatic evaluation on text-to-image synthesis can only provide an image-text
matching score, without considering the object-level compositionality, which results in poor …‏

حفظ اقتباس تم اقتباسها في عدد: 71 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Guiding instruction-based image editing via multimodal large language models‏

TJ Fu, W Hu, X Du, WY Wang, Y Yang… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Instruction-based image editing improves the controllability and flexibility of image
manipulation via natural commands without elaborate descriptions or regional masks …‏

حفظ اقتباس تم اقتباسها في عدد: 92 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Layoutgpt: Compositional visual planning and generation with large language models

Challenges and applications of large language models‏

[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4‏

Next-gpt: Any-to-any multimodal llm‏

Multimodal foundation models: From specialists to general-purpose assistants‏

Boxdiff: Text-to-image synthesis with training-free box-constrained diffusion‏

Mastering text-to-image diffusion: Recaptioning, planning, and generating with multimodal llms‏

Instancediffusion: Instance-level control for image generation‏

Grounded text-to-image synthesis with attention refocusing‏

Llmscore: Unveiling the power of large language models in text-to-image synthesis evaluation‏

Guiding instruction-based image editing via multimodal large language models‏