74.
In our prior work ( (t.co)
In our prior work ( http:// arxiv.org/pdf/2509.26030) we showed that Muon outperforms Adam on heavy-tailed knowledge tasks. In this work, we examine Muon's superiority from the perspective of loss curvature. The main takehome message is