AI Trace
Anthropic: Anthropic operates a formal safety framework called the Responsible Scaling Policy that sets rules for when and how it can train and release more powerful AI models. Under this policy, each new Claude model is assigned a safety level, and passing specific safety tests is required before the model can be deployed. The framework is now in its third version and has been updated as Claude's capabilities have grown. | AI Trace