An O*NET for AI R&D automation
Epoch proposes a 60-plus-task taxonomy of frontier AI research work and rates each task from 0 to 5 by current automability
6 appearances on the backlist front page in the last 30 days.
Epoch proposes a 60-plus-task taxonomy of frontier AI research work and rates each task from 0 to 5 by current automability
Economists often study labor markets using the O*NET database, which breaks ~1000 occupations into tasks. But these tasks are too coarse-grained to track automation in AI R&D specifically, even in occupations closest to “AI researcher”.
Epoch’s audit corrected errors in 42% of FrontierMath Tiers 1–4 problems, raising scores while leaving rankings broadly similar
We’ve backfilled FrontierMath: Tiers 1–4 (v2) scores for a selection of notable models, including recent Claude Opus models. You can find these on our website. We will add scores for Claude Fable 5 and GPT Pro models shortly.
Epoch AI’s tracking places Colossus 1, Anthropic-Amazon New Carlisle, and Meta Prometheus in a rapid sequence of single-site compute records
Looking ahead, our research suggests that no data center will have meaningfully greater capacity than Colossus 2 until the second half of 2027. However, we expect a reversion to trend in late-2027/early-2028 when QTS Cedar Rapids and Meta