@askalphaxiv on Backlist

5 appearances on the backlist front page in the last 30 days.

44.

MiniMax-M2 paper just dropped The key focus of M2 is on something more agent-native. It trains on runnable workspaces and artifact-grounded rewards, then uses Forge to scale RL over long coding, app, search, and office-task trajectories.

by (alphaXiv) · backlist 2026-05-27 · rubric 92.0