Dlo: Dynamic layer operation for efficient vertical scaling of llms

Z Tan, D Dong, X Zhao, J Peng, Y Cheng… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we introduce Dynamic Layer Operations (DLO), a novel approach for vertically
scaling transformer-based Large Language Models (LLMs) by dynamically expanding …