6.
NVIDIA Nemotron 3 Ultra technical report (t.co)
NVIDIA released a 550B-total, 55B-active hybrid Mamba-attention MoE model with an open post-training stack aimed at agentic workloads
1 appearance on the backlist front page in the last 30 days.
NVIDIA released a 550B-total, 55B-active hybrid Mamba-attention MoE model with an open post-training stack aimed at agentic workloads