How to Merge Your Multimodal Models Over Time?

S Dziadzio, V Udandarao, K Roth, A Prabhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Model merging combines multiple expert models-finetuned from a base foundation model
on diverse tasks and domains-into a single, more capable model. However, most existing …

Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging

A Tang, E Yang, L Shen, Y Luo, H Hu, B Du… - arxiv preprint arxiv …, 2025 - arxiv.org
Deep model merging represents an emerging research direction that combines multiple fine-
tuned models to harness their specialized capabilities across different tasks and domains …

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

B Liao, Y Xu, H Dong, J Li, C Monz, S Savarese… - arxiv preprint arxiv …, 2025 - arxiv.org
We introduce Reward-Guided Speculative Decoding (RSD), a novel framework aimed at
improving the efficiency of inference in large language models (LLMs). RSD synergistically …

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

M Khalifa, YC Tan, A Ahmadian, T Hosking… - arxiv preprint arxiv …, 2024 - arxiv.org
Model merging has shown great promise at combining expert models, but the benefit of
merging is unclear when merging``generalist''models trained on many tasks. We explore …

AstroMLab 3: Achieving GPT-4o Level Performance in Astronomy with a Specialized 8B-Parameter Large Language Model

T de Haan, YS Ting, T Ghosal, TD Nguyen… - arxiv preprint arxiv …, 2024 - arxiv.org
AstroSage-Llama-3.1-8B is a domain-specialized natural-language AI assistant tailored for
research in astronomy, astrophysics, and cosmology. Trained on the complete collection of …

Modular, Collaborative and Decentralized Deep Learning

P Yadav, H Liu, W Zhao, A Douillard, M Ciccone… - ICLR 2025 Workshop … - openreview.net
The increasing complexity of modern machine learning models exposes the limitations of
the traditional, monolithic approach to their development, raising concerns about cost and …