METR’s first Frontier Risk Report
Anthropic, Google, Meta, and OpenAI let METR test internal models with chain-of-thought access and review non-public evidence about agent control risks
4 appearances on the backlist front page in the last 30 days.
Anthropic, Google, Meta, and OpenAI let METR test internal models with chain-of-thought access and review non-public evidence about agent control risks
Our report focuses on risks from AI agents intentionally causing harm within an AI company. We highlight 6 key findings that span “means” (what harmful actions agents could take), “motive” (why they might try), and “opportunity” (whether at
We created private reports for each participating company based on our model evaluations and analysis. Participants could then approve what non-public evidence we could disclose in our public report, but had no editorial control.
We surveyed 349 technical researchers, engineers, and managers (in February–April 2026) about how they use AI tools at work. On average, participants self-report that AI use made their work 1.6–2.1x more valuable, and that this multiplier