AI Trace
OpenAI: Before releasing new AI models, OpenAI runs a structured safety-testing process. This includes a formal risk framework with defined thresholds (the Preparedness Framework), a network of outside experts who try to find dangerous capabilities (red teamers), and an open-source benchmarking tool called Evals. Safety evaluations can block a model from being released if risks are deemed too high. | AI Trace