Minimax-01: Scaling foundation models with lightning attention
A Li, B Gong, B Yang, B Shan, C Liu, C Zhu… - arxiv preprint arxiv …, 2025 - arxiv.org
We introduce MiniMax-01 series, including MiniMax-Text-01 and MiniMax-VL-01, which are
comparable to top-tier models while offering superior capabilities in processing longer …
comparable to top-tier models while offering superior capabilities in processing longer …
Scaling laws for linear complexity language models
The interest in linear complexity models for large language models is on the rise, although
their scaling capacity remains uncertain. In this study, we present the scaling laws for linear …
their scaling capacity remains uncertain. In this study, we present the scaling laws for linear …