68.
Everyone's racing to build "AI scientists." So we asked a blunt question:
Everyone's racing to build "AI scientists." So we asked a blunt question: Can today's best coding agents beat the published SOTA of real Nature papers — on their own, no web search, with the original method hidden? Introducing NatureBench