Content ModerationVerified

Roblox deployed a multilayered AI content moderation system—built over approximately five years—that automatically detects policy-violating text, images, 3D objects, and, as of early 2026, entire in-game scenes in real time at a scale of over 750,000 requests per second across 28 languages. The system operates across all user-generated content on the platform.

Details

Roblox's moderation infrastructure uses large transformer-based machine learning models optimized through distillation and quantization to classify text, images, voice, and 3D assets for policy violations in milliseconds. In March 2026, Roblox launched a new real-time multimodal moderation system that evaluates entire in-game scenes—including avatars, text, and 3D objects together—to catch content that passes individual item checks but is problematic in combination (e.g., an offensive drawing combined with an avatar). As of early 2026, the system is shutting down approximately 5,000 servers per day for violations and is targeting 100% playtime monitoring. Roblox states that AI is deployed for moderation only when it outperforms humans in both precision and recall at scale, with human moderators retained for nuanced cases, complex investigations, and appeals.