- Academic Search

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …‏

שמור צטט צוטט על ידי 792 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Understanding llms: A comprehensive overview from training to inference‏

Y Liu, H He, T Han, X Zhang, M Liu, J Tian, Y Zhang… - Neurocomputing, 2024‏ - Elsevier‏

The introduction of ChatGPT has led to a significant increase in the utilization of Large
Language Models (LLMs) for addressing downstream tasks. There's an increasing focus on …‏

שמור צטט צוטט על ידי 86 מאמרים בנושא זה כל 6 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] zhjwpku.com

[PDF][PDF] A survey of large language models‏

WX Zhao, K Zhou, J Li, T Tang… - arxiv preprint arxiv …, 2023‏ - paper-notes.zhjwpku.com‏

Ever since the Turing Test was proposed in the 1950s, humans have explored the mastering
of language intelligence by machine. Language is essentially a complex, intricate system of …‏

שמור צטט צוטט על ידי 3838 מאמרים בנושא זה כל 6 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The llama 3 herd of models‏

A Dubey, A Jauhri, A Pandey, A Kadian… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Modern artificial intelligence (AI) systems are powered by foundation models. This paper
presents a new set of foundation models, called Llama 3. It is a herd of language models …‏

שמור צטט צוטט על ידי 3098 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Transformers are ssms: Generalized models and efficient algorithms through structured state space duality‏

T Dao, A Gu - arxiv preprint arxiv:2405.21060, 2024‏ - arxiv.org‏

While Transformers have been the main architecture behind deep learning's success in
language modeling, state-space models (SSMs) such as Mamba have recently been shown …‏

שמור צטט צוטט על ידי 350 מאמרים בנושא זה כל 6 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deepseek-vl: towards real-world vision-language understanding‏

H Lu, W Liu, B Zhang, B Wang, K Dong, B Liu… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

We present DeepSeek-VL, an open-source Vision-Language (VL) Model designed for real-
world vision and language understanding applications. Our approach is structured around …‏

שמור צטט צוטט על ידי 229 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deepseek llm: Scaling open-source language models with longtermism‏

X Bi, D Chen, G Chen, S Chen, D Dai, C Deng… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

The rapid development of open-source large language models (LLMs) has been truly
remarkable. However, the scaling law described in previous literature presents varying …‏

שמור צטט צוטט על ידי 235 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Meditron-70b: Scaling medical pretraining for large language models‏

Z Chen, AH Cano, A Romanou, A Bonnet… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large language models (LLMs) can potentially democratize access to medical knowledge.
While many efforts have been made to harness and improve LLMs' medical knowledge and …‏

שמור צטט צוטט על ידי 269 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlsys.org

Efficiently scaling transformer inference‏

R Pope, S Douglas, A Chowdhery… - Proceedings of …, 2023‏ - proceedings.mlsys.org‏

We study the problem of efficient generative inference for Transformer models, in one of its
most challenging settings: large deep models, with tight latency targets and long sequence …‏

שמור צטט צוטט על ידי 368 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics‏

K He, R Mao, Q Lin, Y Ruan, X Lan, M Feng… - Information …, 2025‏ - Elsevier‏

The utilization of large language models (LLMs) for Healthcare has generated both
excitement and concern due to their ability to effectively respond to free-text queries with …‏

שמור צטט צוטט על ידי 153 מאמרים בנושא זה כל 5 הגרסאות

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Reducing activation recomputation in large transformer models

A comprehensive overview of large language models‏

Understanding llms: A comprehensive overview from training to inference‏

[PDF][PDF] A survey of large language models‏

The llama 3 herd of models‏

Transformers are ssms: Generalized models and efficient algorithms through structured state space duality‏

Deepseek-vl: towards real-world vision-language understanding‏

Deepseek llm: Scaling open-source language models with longtermism‏

Meditron-70b: Scaling medical pretraining for large language models‏

Efficiently scaling transformer inference‏

[HTML][HTML] A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics‏