UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths

W Mao, Z Yang, MZ Shou - arxiv preprint arxiv:2502.06474, 2025 - arxiv.org
Unified multimodal transformers, which handle both generation and understanding tasks
within a shared parameter space, have received increasing attention in recent research …