- Academic Search

A Imteaj, U Thakker, S Wang, J Li… - IEEE Internet of Things …, 2021 - ieeexplore.ieee.org

Federated learning (FL) is a distributed machine learning strategy that generates a global
model by learning from multiple decentralized edge clients. FL enables on-device training …

Speichern Zitieren Zitiert von: 605 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] arxiv.org

Demystifying parallel and distributed deep learning: An in-depth concurrency analysis

T Ben-Nun, T Hoefler - ACM Computing Surveys (CSUR), 2019 - dl.acm.org

Deep Neural Networks (DNNs) are becoming an important tool in modern computing
applications. Accelerating their training is a major challenge and techniques range from …

Speichern Zitieren Zitiert von: 890 Ähnliche Artikel Alle 28 Versionen

[Free GPT-4]

[PDF] neurips.cc

Qlora: Efficient finetuning of quantized llms

T Dettmers, A Pagnoni, A Holtzman… - Advances in Neural …, 2024 - proceedings.neurips.cc

We present QLoRA, an efficient finetuning approach that reduces memory usage enough to
finetune a 65B parameter model on a single 48GB GPU while preserving full 16-bit …

Speichern Zitieren Zitiert von: 2237 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arxiv preprint arxiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

Speichern Zitieren Zitiert von: 3569 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] acm.org

Efficient memory management for large language model serving with pagedattention

W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng… - Proceedings of the 29th …, 2023 - dl.acm.org

High throughput serving of large language models (LLMs) requires batching sufficiently
many requests at a time. However, existing systems struggle because the key-value cache …

Speichern Zitieren Zitiert von: 1233 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] science.org

Learning skillful medium-range global weather forecasting

R Lam, A Sanchez-Gonzalez, M Willson, P Wirnsberger… - Science, 2023 - science.org

Global medium-range weather forecasting is critical to decision-making across many social
and economic domains. Traditional numerical weather prediction uses increased compute …

Speichern Zitieren Zitiert von: 583 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] arxiv.org

Llamafactory: Unified efficient fine-tuning of 100+ language models

Y Zheng, R Zhang, J Zhang, Y Ye, Z Luo… - arxiv preprint arxiv …, 2024 - arxiv.org

Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.
However, it requires non-trivial efforts to implement these methods on different models. We …

Speichern Zitieren Zitiert von: 257 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Eva: Exploring the limits of masked visual representation learning at scale

Y Fang, W Wang, B **e, Q Sun, L Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com

We launch EVA, a vision-centric foundation model to explore the limits of visual
representation at scale using only publicly accessible data. EVA is a vanilla ViT pre-trained …

Speichern Zitieren Zitiert von: 701 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Scaling language-image pre-training via masking

Y Li, H Fan, R Hu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract We present Fast Language-Image Pre-training (FLIP), a simple and more efficient
method for training CLIP. Our method randomly masks out and removes a large portion of …

Speichern Zitieren Zitiert von: 315 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Lightglue: Local feature matching at light speed

P Lindenberger, PE Sarlin… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce LightGlue, a deep neural network that learns to match local features across
images. We revisit multiple design decisions of SuperGlue, the state of the art in sparse …

Speichern Zitieren Zitiert von: 382 Ähnliche Artikel Alle 7 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Training deep nets with sublinear memory cost

A survey on federated learning for resource-constrained IoT devices

Demystifying parallel and distributed deep learning: An in-depth concurrency analysis

Qlora: Efficient finetuning of quantized llms

A survey of large language models

Efficient memory management for large language model serving with pagedattention

Learning skillful medium-range global weather forecasting

Llamafactory: Unified efficient fine-tuning of 100+ language models

Eva: Exploring the limits of masked visual representation learning at scale

Scaling language-image pre-training via masking

Lightglue: Local feature matching at light speed