3. ProgramBench: rebuild a program from its binary and docs; frontier models solved zero tasks by @boyuan_chen (Boyuan (Nemo) Chen) · backlist 2026-05-07 · rubric 88.0