28.
MiniMax M3: open-weights model with 1M context and sparse attention
MiniMax claims frontier coding and agent benchmarks while using sparse attention to scale native multimodal context to one million tokens
Editor’s note: imported_from_x_likes