35.
Fable 5 ( (x.com)
Fable 5 ( @AnthropicAI ) scores 22% and tops the Hedge-Bench leaderboard. Running Fable was roughly 2X more expensive than Opus 4.8 per trial. For an industry where accuracy is mission critical, human judgement isn't going away