MiniMax M3 open weights: 428B parameters, 1M context (t.co)
MiniMax released a 428B-parameter open-weight model with 23B active parameters and MiniMax Sparse Attention for million-token contexts
5 appearances on the backlist front page in the last 30 days.
MiniMax released a 428B-parameter open-weight model with 23B active parameters and MiniMax Sparse Attention for million-token contexts
MiniMax M3, Open-Weight, Now On Hugging Face Weights: https:// huggingface.co/MiniMaxAI/Mini Max-M3 … MiniMax Sparse Attention: https:// huggingface.co/papers/2606.13 392 …
MiniMax claims frontier coding and agent benchmarks while using sparse attention to scale native multimodal context to one million tokens
Editor’s note: imported_from_x_likes
M3 on @OpenRouter same day we dropped it . 1M context, frontier coding + agentic, native multimodal. 50% off the first week.
M3 on @AskVenice , available anonymously open-weight, frontier coding + agentic, 1M context, native multimodal. Live on day one