محقق Google

H Wang, J Li, H Wu, E Hovy, Y Sun - Engineering, 2023‏ - Elsevier‏

Pre-trained language models have achieved striking success in natural language
processing (NLP), leading to a paradigm shift from supervised learning to pre-training …‏

ذخیره ارجاع بیان شده در 279 یافته مقاله‌های مربوط تمام نسخه‌های 4

[免费ChatGPT] [DeepSeek可用网址] [PDF] springer.com

Large-scale multi-modal pre-trained models: A comprehensive survey‏

X Wang, G Chen, G Qian, P Gao, XY Wei… - Machine Intelligence …, 2023‏ - Springer‏

With the urgent demand for generalized deep models, many pre-trained big models are
proposed, such as bidirectional encoder representations (BERT), vision transformer (ViT) …‏

ذخیره ارجاع بیان شده در 203 یافته مقاله‌های مربوط تمام نسخه‌های 8

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Qwen technical report‏

J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large language models (LLMs) have revolutionized the field of artificial intelligence,
enabling natural language processing tasks that were previously thought to be exclusive to …‏

ذخیره ارجاع بیان شده در 2552 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] neurips.cc

Uni-controlnet: All-in-one control to text-to-image diffusion models‏

S Zhao, D Chen, YC Chen, J Bao… - Advances in …, 2023‏ - proceedings.neurips.cc‏

Text-to-Image diffusion models have made tremendous progress over the past two years,
enabling the generation of highly realistic images based on open-domain text descriptions …‏

ذخیره ارجاع بیان شده در 238 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models‏

D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

In the era of large language models, Mixture-of-Experts (MoE) is a promising architecture for
managing computational costs when scaling up model parameters. However, conventional …‏

ذخیره ارجاع بیان شده در 178 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Galip: Generative adversarial clips for text-to-image synthesis‏

M Tao, BK Bao, H Tang, C Xu - Proceedings of the IEEE …, 2023‏ - openaccess.thecvf.com‏

Synthesizing high-fidelity complex images from text is challenging. Based on large
pretraining, the autoregressive and diffusion models can synthesize photo-realistic images …‏

ذخیره ارجاع بیان شده در 155 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Towards open-world recommendation with knowledge augmentation from large language models‏

Y **, W Liu, J Lin, X Cai, H Zhu, J Zhu, B Chen… - Proceedings of the 18th …, 2024‏ - dl.acm.org‏

Recommender system plays a vital role in various online services. However, its insulated
nature of training and deploying separately within a specific closed domain limits its access …‏

ذخیره ارجاع بیان شده در 148 یافته مقاله‌های مربوط تمام نسخه‌های 3

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model‏

S Smith, M Patwary, B Norick, P LeGresley… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Pretrained general-purpose language models can achieve state-of-the-art accuracies in
various natural language processing domains by adapting to downstream tasks via zero …‏

ذخیره ارجاع بیان شده در 660 یافته مقاله‌های مربوط تمام نسخه‌های 2 نسخه HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Vector quantized diffusion model for text-to-image synthesis‏

S Gu, D Chen, J Bao, F Wen, B Zhang… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation.
This method is based on a vector quantized variational autoencoder (VQ-VAE) whose latent …‏

ذخیره ارجاع بیان شده در 865 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Filip: Fine-grained interactive language-image pre-training‏

L Yao, R Huang, L Hou, G Lu, M Niu, H Xu… - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

Unsupervised large-scale vision-language pre-training has shown promising advances on
various downstream tasks. Existing methods often model the cross-modal interaction either …‏

ذخیره ارجاع بیان شده در 616 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

M6: A chinese multimodal pretrainer

[HTML][HTML] Pre-trained language models and their applications‏

Large-scale multi-modal pre-trained models: A comprehensive survey‏

Qwen technical report‏

Uni-controlnet: All-in-one control to text-to-image diffusion models‏

Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models‏

Galip: Generative adversarial clips for text-to-image synthesis‏

Towards open-world recommendation with knowledge augmentation from large language models‏

Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model‏

Vector quantized diffusion model for text-to-image synthesis‏

Filip: Fine-grained interactive language-image pre-training‏