Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent

Y Wei, A Tang, L Shen, F **ong, C Yuan… - arxiv preprint arxiv …, 2025 - arxiv.org
Merging multiple expert models offers a promising approach for performing multi-task
learning without accessing their original data. Existing methods attempt to alleviate task …