@LinasNasvytis on Backlist

70.

1/ New preprint! Reasoning models often require hundreds of task examples and thousands of rollouts to improve on…

1/ New preprint! Reasoning models often require hundreds of task examples and thousands of rollouts to improve on a task. How can they learn more from much less? Introducing CORE: contrastive self-reflection for rapid, sample-efficient, an

by @LinasNasvytis (Linas Nasvytis) · backlist 2026-06-08 · rubric 72.0