Spawning AI: Spawning AI offers the Data Diligence package, an API and open-source Python library that automates the checking of media URLs against opt-out registries so AI developers can filter out creator-restricted content before training their models. | AI Trace
Productivity AutomationVerified
Spawning AI offers the Data Diligence package, an API and open-sourcePython library that automates the checking of media URLs against opt-out registries so AI developers can filter out creator-restricted content before training their models.
Details
The Data Diligence package allows AI model trainers to automatically check each media item they intend to download against Spawning's Do Not Train Registry and other machine-readable rights reservations (such as HTTP headers and ai.txt). Items registered in the DNTR are not downloaded, and items with other rights reservation signals are excluded from the training dataset. Spawning describes the package as supporting compliance with the EU's text and data mining copyright exceptions and the upcoming AI Act. The package aggregates multiple opt-out standards so developers do not need to manage each registry separately.