Content ModerationVerified

Stability AI uses AI-powered content filters on its platforms and API to detect and block policy-violating content, including prompts or images that may produce NSFW material, child sexual abuse material (CSAM), or other prohibited content. This system operates automatically on both user inputs and generated outputs.

Details

According to Stability AI's published integrity transparency report, the company operates multiple layers of content moderation: in-house text prompt filters that block generation requests violating the Acceptable Use Policy; in-house NSFW image classifiers that flag uploaded images and video; CSAM hashing systems using industry hash lists from Thorn's Safer and the Internet Watch Foundation (IWF); and a combination of automated and human review by an internal Integrity team. The company also uses in-house NSFW classifiers and open-source classifiers to filter training data. Confirmed CSAM is reported to NCMEC.

Products affected

Stability AI Developer Platform APIStable AssistantStable Diffusion (hosted versions)

Sources & Evidence

Company Disclosure

Stability AI's Annual Integrity Transparency Report — Stability AI

Stability AI·Jan 2025

https://stability.ai/news/stability-ais-annual-integrity-transparency-report

Company Disclosure

Understanding Content Filtering and Safeguards at Stability AI

Stability AI·Jan 2024

https://kb.stability.ai/knowledge-base/understanding-content-filtering-and-safeguards-at-stability-ai

Other practices by Stability AI

Creative GenStability AI offers Stable Assistant, a consumer-facing chatbot that combines the company's image, video, audio, and 3D generation models in a single conversational interface. Users can generate or edit images, produce short videos, create music, and convert images into 3D assets through text prompts or by uploading existing media.Creative GenStability AI offers StableLM, an open-source series of large language models for text and code generation, available for developers to use, adapt, and deploy in their own applications. The series began in April 2023 and has been updated through multiple versions including Stable LM 2.Creative GenStability AI offers a suite of thirteen specialized image editing tools via its Developer Platform API, enabling targeted modifications such as object removal, background replacement, style transfer, and resolution upscaling. These tools are available to developers and enterprises integrating AI image editing into their own applications.

Have evidence about Stability AI's AI practices? Submit a report.

Report a Sighting →