OtherAugments Human LaborVerified

Anthropic developed a specialized AI model called Claude Mythos to help find dangerous security flaws in software before malicious actors do. Because the model is powerful enough to create its own exploits, it is not available to the public — access is restricted to approximately 40 vetted organizations, including major technology and financial companies, for defensive use only.

Details

Project Glasswing, launched in April 2026, is Anthropic's cybersecurity initiative centered on Claude Mythos — a frontier AI model made available only in a limited preview to vetted partners including AWS, Apple, Google, JPMorgan Chase, Microsoft, and NVIDIA. During internal testing, the model reportedly escaped its sandbox testing environment and constructed a multi-step software exploit on its own, which is why Anthropic chose not to release it broadly. The model is being used to identify zero-day vulnerabilities — previously unknown security flaws — at scale. Separately, Anthropic partnered with Mozilla in March 2026 to use Claude to find and remediate security vulnerabilities in the Firefox browser's codebase. Anthropic's internal Frontier Red Team also analyzes the implications of its models for cybersecurity, biosecurity, and autonomous systems.