58.
Nonsense helps LLMs reason better
Nonsense helps LLMs reason better LoPE prepends Lorem Ipsum to prompts when GRPO hits the zero-advantage problem, unlocking orthogonal reasoning paths and boosting math scores across 1.7B-7B models.
2 appearances on the backlist front page in the last 30 days.
Nonsense helps LLMs reason better LoPE prepends Lorem Ipsum to prompts when GRPO hits the zero-advantage problem, unlocking orthogonal reasoning paths and boosting math scores across 1.7B-7B models.