SandVox

Game Audio Localization — Dubbing, Subtitles, and Voice Recording

Game Localization · All Services

Game Audio Localization — Dubbing, Subtitles, and Voice Recording

Native translators. Translation Memory. In-build LocQA. Get a free quote →

Audio localization for games encompasses subtitles, partial dubbing, and full voice acting replacement — three distinct approaches with very different costs, timelines, and player expectations. Most game localization is text-only (subtitles + UI text) because full dubbing is expensive and time-intensive. For games with significant dialogue, the audio localization decision affects budget, player experience, and market expectations. This guide explains the options, costs, and considerations for game audio localization.

Sub-Only vs. Full Dub — The Core Decision

The core audio localization decision is whether to dub (replace voice acting with recorded performances in the target language) or subtitle (provide on-screen text while keeping the original audio). Sub-only localization is: significantly cheaper (no voice recording costs), faster (no casting, recording, implementation), and often the expectation for certain game genres (JRPGs, anime games) and markets (Japanese games in Western markets). Full dubbing is: expensive (casting, recording, post-production), time-intensive (adds weeks to timelines), and the expectation for certain genres (AAA action games, narrative games targeting Western markets) and some markets (Germany has strong dubbing expectations for major releases). Most indie and mid-size studios choose sub-only localization for all markets, with full or partial dubbing reserved for the highest-profile releases.

The Subtitle Pipeline

Subtitles are the text displayed on screen when characters speak. Subtitle localization in games involves: translating all voiced lines from the original script, timing subtitle display to match audio playback, line breaking subtitle text for readability (subtitles have specific character-per-line constraints), and implementing subtitle files in the game engine (typically SRT, VTT, or engine-native subtitle formats). Game subtitle translation differs from text string translation in several ways: subtitle lines have timing constraints tied to audio, subtitle display speed must be readable for players, and subtitle context depends on audio — translators must be able to listen to (or review transcripts of) the original voice performance to capture tone and intent.

Voice Direction and Localization Dubbing

Full game dubbing involves: (1) Script adaptation — rewriting dialogue for lip-sync (matching mouth movements to new audio), natural speaking rhythm, and target-language idiom. (2) Casting — finding voice actors in the target language whose voices and delivery match the character. (3) Direction — voice direction (typically remote or in-person) to ensure performances match the game’s tone and character profiles. (4) Recording — studio recording of all voiced lines. (5) Post-production — audio editing, noise reduction, and level matching. (6) Implementation — integrating localized audio files into the game engine’s audio system. Lip-sync dubbing (for cutscenes with visible speaking characters) requires additional script adaptation to match mouth movements — significantly increasing complexity and cost compared to off-screen dubbing.

Partial Dubbing — Menus and Critical Lines

Between sub-only and full dub lies partial dubbing — recording a subset of voiced content. Common partial dubbing approaches: (1) UI audio dubbing — localizing voiced menu elements, system prompts, and tutorial narration while leaving gameplay dialogue in the original language. (2) Key character lines — dubbing protagonist lines and major character interactions while leaving minor NPC dialogue subtitled. (3) Cinematics-only — dubbing cutscenes while leaving gameplay dialogue audio unchanged. Partial dubbing allows publishers to deliver some localized audio content at a fraction of full dub cost. It is most effective when the audio experience of the game is relatively concentrated — if 80% of player audio exposure is in cutscenes, cinematics dubbing captures most of the perceived audio localization value.

Frequently Asked Questions

What languages typically expect dubbed game audio?

Dubbing expectations vary by market and genre. Germany, France, Spain, and Brazil have traditionally been the ‘FIGS’ markets where dubbing expectations are highest — major releases typically include German, French, Spanish, and Brazilian Portuguese dubbing. Japan-developed games shipping to Japan are always fully voiced in Japanese; Western games shipping to Japan frequently dub into Japanese for major releases. Most other markets (Eastern Europe, Southeast Asia, smaller European markets) do not have strong dubbing expectations — sub-only localization is standard. Japanese game exports to Western markets are typically sub-only, as JRPG audiences prefer original Japanese audio.

How much does dubbing add to localization cost?

Voice dubbing costs depend on word count, language count, and dubbing type. As a rough order of magnitude: a small game with 5,000 words of voiced dialogue might cost $5,000–15,000 per language to dub (casting, recording, direction, post-production). A mid-size game with 25,000 words of voiced dialogue might cost $25,000–75,000 per language. Lip-sync dubbing for cinematic-heavy games adds a significant premium — 1.5–2x the cost of off-screen dubbing for the script adaptation and recording overhead. For comparison, text-only localization of the same word count in the same language might cost $1,500–5,000. Dubbing is typically 5–20x more expensive per language than text-only localization.

Start Your Game Audio Localization Project

Tell us your word count, target languages, and platform. We return translated files ready for import — with Translation Memory and terminology glossary included. Free quote in one business day.