- Academic Search

G Team, M Riviere, S Pathak, PG Sessa… - arxiv preprint arxiv …, 2024 - arxiv.org

In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-
of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new …

Speichern Zitieren Zitiert von: 314 Ähnliche Artikel Alle 3 Versionen Im Cache

[Free GPT-4]

[PDF] acm.org

Security and privacy challenges of large language models: A survey

BC Das, MH Amini, Y Wu - ACM Computing Surveys, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

Speichern Zitieren Zitiert von: 95 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] nature.com

Toward expert-level medical question answering with large language models

K Singhal, T Tu, J Gottweis, R Sayres, E Wulczyn… - Nature Medicine, 2025 - nature.com

Large language models (LLMs) have shown promise in medical question answering, with
Med-PaLM being the first to exceed a 'passing'score in United States Medical Licensing …

Speichern Zitieren Zitiert von: 698 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] arxiv.org

How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites

Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui… - Science China …, 2024 - Springer

In this paper, we introduce InternVL 1.5, an open-source multimodal large language model
(MLLM) to bridge the capability gap between open-source and proprietary commercial …

Speichern Zitieren Zitiert von: 350 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] arxiv.org

MM1: methods, analysis and insights from multimodal LLM pre-training

B McKinzie, Z Gan, JP Fauconnier, S Dodge… - … on Computer Vision, 2024 - Springer

In this work, we discuss building performant Multimodal Large Language Models (MLLMs).
In particular, we study the importance of various architecture components and data choices …

Speichern Zitieren Zitiert von: 180 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] arxiv.org

A survey on rag meeting llms: Towards retrieval-augmented large language models

W Fan, Y Ding, L Ning, S Wang, H Li, D Yin… - Proceedings of the 30th …, 2024 - dl.acm.org

As one of the most advanced techniques in AI, Retrieval-Augmented Generation (RAG) can
offer reliable and up-to-date external knowledge, providing huge convenience for numerous …

Speichern Zitieren Zitiert von: 171 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] arxiv.org

Minicpm-v: A gpt-4v level mllm on your phone

Y Yao, T Yu, A Zhang, C Wang, J Cui, H Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org

The recent surge of Multimodal Large Language Models (MLLMs) has fundamentally
reshaped the landscape of AI research and industry, shedding light on a promising path …

Speichern Zitieren Zitiert von: 181 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Llava-uhd: an lmm perceiving any aspect ratio and high-resolution images

Z Guo, R Xu, Y Yao, J Cui, Z Ni, C Ge, TS Chua… - … on Computer Vision, 2024 - Springer

Visual encoding constitutes the basis of large multimodal models (LMMs) in understanding
the visual world. Conventional LMMs process images in fixed sizes and limited resolutions …

Speichern Zitieren Zitiert von: 92 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] aclanthology.org

Orpo: Monolithic preference optimization without reference model

J Hong, N Lee, J Thorne - … of the 2024 Conference on Empirical …, 2024 - aclanthology.org

While recent preference alignment algorithms for language models have demonstrated
promising results, supervised fine-tuning (SFT) remains imperative for achieving successful …

Speichern Zitieren Zitiert von: 135 Ähnliche Artikel HTML-Version

[Free GPT-4]

[PDF] springer.com

Large language models for generative information extraction: A survey

D Xu, W Chen, W Peng, C Zhang, T Xu, X Zhao… - Frontiers of Computer …, 2024 - Springer

Abstract Information Extraction (IE) aims to extract structural knowledge from plain natural
language texts. Recently, generative Large Language Models (LLMs) have demonstrated …

Speichern Zitieren Zitiert von: 131 Ähnliche Artikel Alle 2 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Gpt-4 technical report

Gemma 2: Improving open language models at a practical size

Security and privacy challenges of large language models: A survey

Toward expert-level medical question answering with large language models

How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites

MM1: methods, analysis and insights from multimodal LLM pre-training

A survey on rag meeting llms: Towards retrieval-augmented large language models

Minicpm-v: A gpt-4v level mllm on your phone

Llava-uhd: an lmm perceiving any aspect ratio and high-resolution images

Orpo: Monolithic preference optimization without reference model

Large language models for generative information extraction: A survey