Google Наука

Y Liu, J Cao, C Liu, K Ding, L ** - ar** Large Language Models (LLMs) with up to trillion
parameters has been met with concerns regarding resource efficiency and practical …

Запазване Позоваване С позовавания в 219 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Personal llm agents: Insights and survey about the capability, efficiency and security

Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …

Запазване Позоваване С позовавания в 131 Сродни статии Всички 3 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Splitwise: Efficient generative llm inference using phase splitting

P Patel, E Choukse, C Zhang, A Shah… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org

Generative large language model (LLM) applications are growing rapidly, leading to large-
scale deployments of expensive and power-hungry GPUs. Our characterization of LLM …

Запазване Позоваване С позовавания в 111 Сродни статии Всички 6 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on large language models for code generation

J Jiang, F Wang, J Shen, S Kim, S Kim - arxiv preprint arxiv:2406.00515, 2024 - arxiv.org

Large Language Models (LLMs) have garnered remarkable advancements across diverse
code-related tasks, known as Code LLMs, particularly in code generation that generates …

Запазване Позоваване С позовавания в 118 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The unreasonable ineffectiveness of the deeper layers

A Gromov, K Tirumala, H Shapourian… - arxiv preprint arxiv …, 2024 - arxiv.org

We empirically study a simple layer-pruning strategy for popular families of open-weight
pretrained LLMs, finding minimal degradation of performance on different question …

Запазване Позоваване С позовавания в 76 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llava-prumerge: Adaptive token reduction for efficient large multimodal models

Y Shang, M Cai, B Xu, YJ Lee, Y Yan - arxiv preprint arxiv:2403.15388, 2024 - arxiv.org

Large Multimodal Models (LMMs) have shown significant visual reasoning capabilities by
connecting a visual encoder and a large language model. LMMs typically take in a fixed and …

Запазване Позоваване С позовавания в 77 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tinyllava: A framework of small-scale large multimodal models

B Zhou, Y Hu, X Weng, J Jia, J Luo, X Liu, J Wu… - arxiv preprint arxiv …, 2024 - arxiv.org

We present the TinyLLaVA framework that provides a unified perspective in designing and
analyzing the small-scale Large Multimodal Models (LMMs). We empirically study the effects …

Запазване Позоваване С позовавания в 88 Сродни статии Всички 4 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Phi-2: The surprising power of small language models

Datasets for large language models: A comprehensive survey

Personal llm agents: Insights and survey about the capability, efficiency and security

Splitwise: Efficient generative llm inference using phase splitting

A survey on large language models for code generation

The unreasonable ineffectiveness of the deeper layers

Llava-prumerge: Adaptive token reduction for efficient large multimodal models

Tinyllava: A framework of small-scale large multimodal models