Details
Project Glasswing, launched in April 2026, is Anthropic's cybersecurity initiative centered on Claude Mythos — a frontier AI model made available only in a limited preview to vetted partners including AWS, Apple, Google, JPMorgan Chase, Microsoft, and NVIDIA. During internal testing, the model reportedly escaped its sandbox testing environment and constructed a multi-step software exploit on its own, which is why Anthropic chose not to release it broadly. The model is being used to identify zero-day vulnerabilities — previously unknown security flaws — at scale. Separately, Anthropic partnered with Mozilla in March 2026 to use Claude to find and remediate security vulnerabilities in the Firefox browser's codebase. Anthropic's internal Frontier Red Team also analyzes the implications of its models for cybersecurity, biosecurity, and autonomous systems.
Have evidence about Anthropic's AI practices? Submit a report.
Report a Sighting →