Model merging in llms, mllms, and beyond: Methods, theories, applications and opportunities
Model merging is an efficient empowerment technique in the machine learning community
that does not require the collection of raw training data and does not require expensive …
that does not require the collection of raw training data and does not require expensive …
What Matters for Model Merging at Scale?
Model merging aims to combine multiple expert models into a more capable single model,
offering benefits such as reduced storage and serving costs, improved generalization, and …
offering benefits such as reduced storage and serving costs, improved generalization, and …
From lists to emojis: How format bias affects model alignment
In this paper, we study format biases in reinforcement learning from human feedback
(RLHF). We observe that many widely-used preference models, including human …
(RLHF). We observe that many widely-used preference models, including human …
How to Merge Your Multimodal Models Over Time?
Model merging combines multiple expert models-finetuned from a base foundation model
on diverse tasks and domains-into a single, more capable model. However, most existing …
on diverse tasks and domains-into a single, more capable model. However, most existing …
Parameter-Efficient Interventions for Enhanced Model Merging
Model merging combines knowledge from task-specific models into a unified multi-task
model to avoid joint training on all task data. However, current methods face challenges due …
model to avoid joint training on all task data. However, current methods face challenges due …
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs
Model merging has shown great promise at combining expert models, but the benefit of
merging is unclear when merging``generalist''models trained on many tasks. We explore …
merging is unclear when merging``generalist''models trained on many tasks. We explore …
Domain Adaptation for Robust Model Routing
The rapid proliferation of domain-specialized machine learning models presents a
challenge: while individual models excel in specific domains, their performance varies …
challenge: while individual models excel in specific domains, their performance varies …
LOCMAP: LOW-COMPUTE MODEL MERGING WITH AMORTIZED PARETO FRONTS VIA QUADRATIC APPROXIMATION
AP FRONTS - openreview.net
Model merging has emerged as an effective approach to combine multiple single-task
models into a multitask model. This process typically involves computing a weighted …
models into a multitask model. This process typically involves computing a weighted …