الباحث العلمي من Google

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …‏

حفظ اقتباس تم اقتباسها في عدد: 713 مقالات ذات صلة الإصدارات الـ 3كلها إصدار HTML‏

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly‏

Y Yao, J Duan, K Xu, Y Cai, Z Sun, Y Zhang - High-Confidence Computing, 2024‏ - Elsevier‏

Abstract Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized
natural language understanding and generation. They possess deep language …‏

حفظ اقتباس تم اقتباسها في عدد: 530 مقالات ذات صلة الإصدارات الـ 11كلها

[Free GPT-4]

[PDF] arxiv.org

A survey of large language models‏

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …‏

حفظ اقتباس تم اقتباسها في عدد: 3574 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Dinov2: Learning robust visual features without supervision‏

M Oquab, T Darcet, T Moutakanni, H Vo… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

The recent breakthroughs in natural language processing for model pretraining on large
quantities of data have opened the way for similar foundation models in computer vision …‏

حفظ اقتباس تم اقتباسها في عدد: 2229 مقالات ذات صلة الإصدارات الـ 11كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

The RefinedWeb dataset for Falcon LLM: outperforming curated corpora with web data, and web data only‏

G Penedo, Q Malartic, D Hesslow, R Cojocaru… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large language models are commonly trained on a mixture of filtered web data and curated
high-quality corpora, such as social media conversations, books, or technical papers. This …‏

حفظ اقتباس تم اقتباسها في عدد: 738 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]

[PDF] neurips.cc

Toolformer: Language models can teach themselves to use tools‏

T Schick, J Dwivedi-Yu, R Dessì… - Advances in …, 2023‏ - proceedings.neurips.cc‏

Abstract Language models (LMs) exhibit remarkable abilities to solve new tasks from just a
few examples or textual instructions, especially at scale. They also, paradoxically, struggle …‏

حفظ اقتباس تم اقتباسها في عدد: 1455 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Yi: Open foundation models by 01. ai‏

A Young, B Chen, C Li, C Huang, G Zhang… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

We introduce the Yi model family, a series of language and multimodal models that
demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and …‏

حفظ اقتباس تم اقتباسها في عدد: 345 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

MM1: methods, analysis and insights from multimodal LLM pre-training‏

B McKinzie, Z Gan, JP Fauconnier, S Dodge… - … on Computer Vision, 2024‏ - Springer‏

In this work, we discuss building performant Multimodal Large Language Models (MLLMs).
In particular, we study the importance of various architecture components and data choices …‏

حفظ اقتباس تم اقتباسها في عدد: 180 مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]

[PDF] neurips.cc

Scaling data-constrained language models‏

N Muennighoff, A Rush, B Barak… - Advances in …, 2023‏ - proceedings.neurips.cc‏

The current trend of scaling language models involves increasing both parameter count and
training dataset size. Extrapolating this trend suggests that training dataset size may soon be …‏

حفظ اقتباس تم اقتباسها في عدد: 222 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]

[PDF] neurips.cc

Gpt3. int8 (): 8-bit matrix multiplication for transformers at scale‏

T Dettmers, M Lewis, Y Belkada… - Advances in Neural …, 2022‏ - proceedings.neurips.cc‏

Large language models have been widely adopted but require significant GPU memory for
inference. We develop a procedure for Int8 matrix multiplication for feed-forward and …‏

حفظ اقتباس تم اقتباسها في عدد: 965 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

CCNet: Extracting high quality monolingual datasets from web crawl data

A comprehensive overview of large language models‏

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly‏

A survey of large language models‏

Dinov2: Learning robust visual features without supervision‏

The RefinedWeb dataset for Falcon LLM: outperforming curated corpora with web data, and web data only‏

Toolformer: Language models can teach themselves to use tools‏

Yi: Open foundation models by 01. ai‏

MM1: methods, analysis and insights from multimodal LLM pre-training‏

Scaling data-constrained language models‏

Gpt3. int8 (): 8-bit matrix multiplication for transformers at scale‏