36. “Sparser, Faster, Lighter Transformer Language Models” by @askalphaxiv (alphaXiv) · backlist 2026-05-09 · rubric 91.0
36. "DGPO: Distribution Guided Policy Optimization for Fine Grained Credit Assignment" by @askalphaxiv (alphaXiv) · backlist 2026-05-07 · rubric 95.0
62. With the incredible depth of the DeepSeek-V4 Technical Report, we put together a list of must-read papers behind … by @askalphaxiv (alphaXiv) · backlist 2026-05-06 · rubric 86.0