- Academic Search

W Yin, M Chen, R Zhang, B Zhou… - Proceedings of the …, 2024 - aclanthology.org

General-purpose large language models (LLMs) are progressively expanding both in scale
and access to unpublic training data. This has led to notable progress in a variety of AI …

Zapisz Cytuj Cytowane przez 2 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Evaluating and Advancing Multimodal Large Language Models in Ability Lens

F Chen, C Gou, J Liu, Y Yang, Z Li, J Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

As multimodal large language models (MLLMs) advance rapidly, rigorous evaluation has
become essential, providing further guidance for their development. In this work, we focus …

Zapisz Cytuj Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]

[PDF] openreview.net

Modular, Collaborative and Decentralized Deep Learning

P Yadav, H Liu, W Zhao, A Douillard, M Ciccone… - ICLR 2025 Workshop … - openreview.net

The increasing complexity of modern machine learning models exposes the limitations of
the traditional, monolithic approach to their development, raising concerns about cost and …

Zapisz Cytuj Powiązane artykuły Wersja HTML

[Free GPT-4]

[PDF] openreview.net

LLM Merging Competition Technical Report: Efficient Model Merging with Strategic Model Selection, Merging, and Hyperparameter Optimization

Z Di, Y Yang, M Jiang, B Li, H Qian, A Zhou - LLM Merging Competition at … - openreview.net

The LLM Merging Competition in NeurIPS'24 aims to build LLMs efficiently through model
merging, which enables the combination of multiple specialized fine-tuned models into a …

Zapisz Cytuj Powiązane artykuły Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Llm merging: Building llms efficiently through merging

Enhancing LLM Capabilities Beyond Scaling Up

Evaluating and Advancing Multimodal Large Language Models in Ability Lens

Modular, Collaborative and Decentralized Deep Learning

LLM Merging Competition Technical Report: Efficient Model Merging with Strategic Model Selection, Merging, and Hyperparameter Optimization