@AdtRaghunathan on Backlist

45.

We've always intuited that verification is easier than generation. Chen's new work shows that explicitly training…

We've always intuited that verification is easier than generation. Chen's new work shows that explicitly training for it unlocks massive self-improvement: 14× boost in test-time refinement on hard reasoning 30% gain beyond the RL plateau

by @AdtRaghunathan (Aditi Raghunathan) · backlist 2026-06-08 · rubric 79.0