Embark Studios: Embark uses AI text-to-speech technology trained on contracted human voice actors' recordings to generate in-game voice lines, including commentator dialogue, player character barks, and contextual callouts like ping system audio. | AI Trace
Creative GenerationReplaces Human LaborVerified
Embark uses AI text-to-speech technology trained on contracted human voice actors' recordings to generate in-game voice lines, including commentator dialogue, player character barks, and contextual callouts like ping system audio.
Details
First publicly confirmed in July 2023 when audio engineers Andreas Almström and Carl Strandberg revealed on the "Meet the Makers" podcast that "all the contestant voices, like the barks and both our commentators, are AI text-to-speech" in The Finals. The TTS system is trained on recordings from professional voice actors who are paid both for booth time and for licensing their voices. Embark's stated rationale is speed — generating lines "in hours rather than months"— enabling rapid game updates. The practice sparked significant backlash from voice actors including Gianni Matragrano (Ultrakill) and Elsie Lovelock (Baldur's Gate 3). By March 2026, CEO Söderlund acknowledged "a real professional actor is better than AI; that's just how it is" and confirmed some ARC Raiders lines had been re-recorded with human actors post-launch. The studio now frames TTS as a "production tool" for testing and prototyping lines before deciding what to record with humans, and for non-essential lines like ping callouts.