48.
prime-rl can now train 1T parameters MoE blazingly fast, under 5 minutes per step, or 1k steps in ~3 days
prime-rl can now train 1T parameters MoE blazingly fast, under 5 minutes per step, or 1k steps in ~3 days To achieve this we shipped in our latest prime-rl 0.6.0: * inference: wide-ep, fp8 inference, llm-d router, mooncake, kv cache cpu