@HuggingPapers on Backlist

8 appearances on the backlist front page in the last 30 days.

34.

LongTraceRL Teaches LLMs to reason through 128K contexts by learning from search agent trajectories and fine-grained entity-level rubric rewards.

by (DailyPapers) · backlist 2026-06-01 · rubric 93.0