How to Merge Your Multimodal Models Over Time?
Model merging combines multiple expert models-finetuned from a base foundation model
on diverse tasks and domains-into a single, more capable model. However, most existing …
on diverse tasks and domains-into a single, more capable model. However, most existing …
Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging
Deep model merging represents an emerging research direction that combines multiple fine-
tuned models to harness their specialized capabilities across different tasks and domains …
tuned models to harness their specialized capabilities across different tasks and domains …
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
We introduce Reward-Guided Speculative Decoding (RSD), a novel framework aimed at
improving the efficiency of inference in large language models (LLMs). RSD synergistically …
improving the efficiency of inference in large language models (LLMs). RSD synergistically …
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs
Model merging has shown great promise at combining expert models, but the benefit of
merging is unclear when merging``generalist''models trained on many tasks. We explore …
merging is unclear when merging``generalist''models trained on many tasks. We explore …
AstroMLab 3: Achieving GPT-4o Level Performance in Astronomy with a Specialized 8B-Parameter Large Language Model
AstroSage-Llama-3.1-8B is a domain-specialized natural-language AI assistant tailored for
research in astronomy, astrophysics, and cosmology. Trained on the complete collection of …
research in astronomy, astrophysics, and cosmology. Trained on the complete collection of …
Modular, Collaborative and Decentralized Deep Learning
The increasing complexity of modern machine learning models exposes the limitations of
the traditional, monolithic approach to their development, raising concerns about cost and …
the traditional, monolithic approach to their development, raising concerns about cost and …