@zhuoran_yang on Backlist

1 appearance on the backlist front page in the last 30 days.

74.

(t.co)

In our prior work ( http:// arxiv.org/pdf/2509.26030) we showed that Muon outperforms Adam on heavy-tailed knowledge tasks. In this work, we examine Muon's superiority from the perspective of loss curvature. The main takehome message is

by (Zhuoran Yang) · backlist 2026-06-04 · rubric 74.0