Meta: Meta is training large language models to read its content policies and make enforcement decisions on flagged posts — replacing human content reviewers in an increasing number of violation categories. | AI Trace
Content ModerationReplaces Human LaborVerified
Meta is training large language models to read its content policies and make enforcement decisions on flagged posts — replacing human content reviewers in an increasing number of violation categories.
Details
Rolled out Q1 2025. LLMs trained on Meta's Community Standards make violation determinations. Early internal tests show LLM performance exceeding human reviewer accuracy in select policy areas. NPR reporting (May 2025) revealed Meta plans to replace human 'privacy and societal risk' assessors with AI. Meta's Oversight Board has commented that this shift requires robust transparency and appeals mechanisms.