41.
I'm exploring soft muon for RL. It may encourage diversity and exploration while being more robust to noisy small… (t.co)
I'm exploring soft muon for RL. It may encourage diversity and exploration while being more robust to noisy small singular modes of the policy gradient compared to full Muon. https:// nilin.github.io/rl-diversity-s oft-muon/ …