Enhancing LLM Capabilities Beyond Scaling Up

W Yin, M Chen, R Zhang, B Zhou… - Proceedings of the …, 2024 - aclanthology.org
General-purpose large language models (LLMs) are progressively expanding both in scale
and access to unpublic training data. This has led to notable progress in a variety of AI …

Evaluating and Advancing Multimodal Large Language Models in Ability Lens

F Chen, C Gou, J Liu, Y Yang, Z Li, J Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
As multimodal large language models (MLLMs) advance rapidly, rigorous evaluation has
become essential, providing further guidance for their development. In this work, we focus …

Modular, Collaborative and Decentralized Deep Learning

P Yadav, H Liu, W Zhao, A Douillard, M Ciccone… - ICLR 2025 Workshop … - openreview.net
The increasing complexity of modern machine learning models exposes the limitations of
the traditional, monolithic approach to their development, raising concerns about cost and …

LLM Merging Competition Technical Report: Efficient Model Merging with Strategic Model Selection, Merging, and Hyperparameter Optimization

Z Di, Y Yang, M Jiang, B Li, H Qian, A Zhou - LLM Merging Competition at … - openreview.net
The LLM Merging Competition in NeurIPS'24 aims to build LLMs efficiently through model
merging, which enables the combination of multiple specialized fine-tuned models into a …