- Academic Search

Artikel

Scholar

Ungefähr 783 Ergebnisse (0,03 Sek.)

Mein Profil Meine Bibliothek

Multimodal few-shot learning with frozen language models

In Artikeln mit Zitaten suchen

[Free GPT-4]
[DeepSeek]

[PDF] github.io

The rise and potential of large language model based agents: A survey

Z ** language-image pre-training with frozen image encoders and large language models

J Li, D Li, S Savarese, S Hoi - International conference on …, 2023 - proceedings.mlr.press

The cost of vision-and-language pre-training has become increasingly prohibitive due to
end-to-end training of large-scale models. This paper proposes BLIP-2, a generic and …

Speichern Zitieren Zitiert von: 4875 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

MM1: methods, analysis and insights from multimodal LLM pre-training

B McKinzie, Z Gan, JP Fauconnier, S Dodge… - … on Computer Vision, 2024 - Springer

In this work, we discuss building performant Multimodal Large Language Models (MLLMs).
In particular, we study the importance of various architecture components and data choices …

Speichern Zitieren Zitiert von: 187 Ähnliche Artikel Alle 2 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Multimodal few-shot learning with frozen language models

The rise and potential of large language model based agents: A survey

MM1: methods, analysis and insights from multimodal LLM pre-training