MiniMax-01: Scaling Foundation Models with Lightning Attention

Open in new window