Anthropic’s Nicholas Carlini shifts from AI safety warning to model-release advocate

Published June 17, 2026

Score

The Trump administration has raised concerns about whether Anthropic's frontier AI systems pose cybersecurity risks, a worry shared among roughly 700 cybersecurity experts who discussed the issue in March. The specific terms of Anthropic's access controls for Mythos and the scope of the restricted preview remain unclear.

For practitioners, this reflects an unresolved tension in AI governance: whether models capable of advanced vulnerability discovery should be withheld entirely, tightly restricted to trusted researchers, or released under controlled conditions. As U.S. officials scrutinize the cybersecurity implications of frontier AI systems, Anthropic's approach to Mythos will likely become a test case for how companies balance capability demonstration, safety assurance, and regulatory pressure. Attorneys advising AI companies or monitoring federal AI policy should track both the access framework Anthropic ultimately adopts and any formal guidance the administration issues on cybersecurity-adjacent AI capabilities.

Anthropic’s Nicholas Carlini shifts from AI safety warning to model-release advocate

Why it matters

Sources

mail Subscribe to Law And Technology email updates

Related

U.S. AI governance is shifting to real-time controls as policy lags

EU institutions strike deal on Digital Omnibus delaying key AI Act deadlines

OpenClaw founders warn AI-generated “vibe slop” is creating risky code