Google Akademik

H Wang, J Li, H Wu, E Hovy, Y Sun - Engineering, 2023 - Elsevier

Pre-trained language models have achieved striking success in natural language
processing (NLP), leading to a paradigm shift from supervised learning to pre-training …

Kaydet Alıntı yap Alıntılanma sayısı: 278 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] springer.com

Large-scale multi-modal pre-trained models: A comprehensive survey

X Wang, G Chen, G Qian, P Gao, XY Wei… - Machine Intelligence …, 2023 - Springer

With the urgent demand for generalized deep models, many pre-trained big models are
proposed, such as bidirectional encoder representations (BERT), vision transformer (ViT) …

Kaydet Alıntı yap Alıntılanma sayısı: 192 İlgili makaleler 8 sürümün hepsi

[Free GPT-4]

[PDF] researchhub.com

[PDF][PDF] Qwen-vl: A versatile vision-language model for understanding, localization, text reading, and beyond

J Bai, S Bai, S Yang, S Wang… - arxiv preprint …, 2023 - storage.prod.researchhub.com

In this work, we introduce the Qwen-VL series, a set of large-scale vision-language models
(LVLMs) designed to perceive and understand both texts and images. Starting from the …

Kaydet Alıntı yap Alıntılanma sayısı: 548 İlgili makaleler HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Qwen-vl: A frontier large vision-language model with versatile abilities

J Bai, S Bai, S Yang, S Wang, S Tan, P Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

In this work, we introduce the Qwen-VL series, a set of large-scale vision-language models
(LVLMs) designed to perceive and understand both texts and images. Starting from the …

Kaydet Alıntı yap Alıntılanma sayısı: 1013 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] neurips.cc

Uni-controlnet: All-in-one control to text-to-image diffusion models

S Zhao, D Chen, YC Chen, J Bao… - Advances in …, 2024 - proceedings.neurips.cc

Text-to-Image diffusion models have made tremendous progress over the past two years,
enabling the generation of highly realistic images based on open-domain text descriptions …

Kaydet Alıntı yap Alıntılanma sayısı: 227 İlgili makaleler 9 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Towards open-world recommendation with knowledge augmentation from large language models

Y **, W Liu, J Lin, X Cai, H Zhu, J Zhu, B Chen… - Proceedings of the 18th …, 2024 - dl.acm.org

Recommender system plays a vital role in various online services. However, its insulated
nature of training and deploying separately within a specific closed domain limits its access …

Kaydet Alıntı yap Alıntılanma sayısı: 147 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model

S Smith, M Patwary, B Norick, P LeGresley… - arxiv preprint arxiv …, 2022 - arxiv.org

Pretrained general-purpose language models can achieve state-of-the-art accuracies in
various natural language processing domains by adapting to downstream tasks via zero …

Kaydet Alıntı yap Alıntılanma sayısı: 686 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] thecvf.com

Vector quantized diffusion model for text-to-image synthesis

S Gu, D Chen, J Bao, F Wen, B Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation.
This method is based on a vector quantized variational autoencoder (VQ-VAE) whose latent …

Kaydet Alıntı yap Alıntılanma sayısı: 858 İlgili makaleler 10 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Filip: Fine-grained interactive language-image pre-training

L Yao, R Huang, L Hou, G Lu, M Niu, H Xu… - arxiv preprint arxiv …, 2021 - arxiv.org

Unsupervised large-scale vision-language pre-training has shown promising advances on
various downstream tasks. Existing methods often model the cross-modal interaction either …

Kaydet Alıntı yap Alıntılanma sayısı: 608 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] A survey of transformers

T Lin, Y Wang, X Liu, X Qiu - AI open, 2022 - Elsevier

Transformers have achieved great success in many artificial intelligence fields, such as
natural language processing, computer vision, and audio processing. Therefore, it is natural …

Kaydet Alıntı yap Alıntılanma sayısı: 1466 İlgili makaleler 4 sürümün hepsi

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

M6: A chinese multimodal pretrainer

[HTML][HTML] Pre-trained language models and their applications

Large-scale multi-modal pre-trained models: A comprehensive survey

[PDF][PDF] Qwen-vl: A versatile vision-language model for understanding, localization, text reading, and beyond

Qwen-vl: A frontier large vision-language model with versatile abilities

Uni-controlnet: All-in-one control to text-to-image diffusion models

Towards open-world recommendation with knowledge augmentation from large language models

Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model

Vector quantized diffusion model for text-to-image synthesis

Filip: Fine-grained interactive language-image pre-training

[HTML][HTML] A survey of transformers