What is Video Translation?

Translating a spoken-video script into another language and re-rendering the speaker so their lips match the new audio. Differs from subtitles (which just overlay text) — translated video keeps the speaker on camera and feels native to the target audience.

Translate a video into any language →

← AI Dubbing AI Spokesperson →

Related Terms

AI Dubbing

Automatically replacing the original audio in a video with a synthesized translation, while keeping mouth movement convincingly aligned. Puppetry supports AI dubbing across 65+ languages — paste a script in a new language and the lip sync re-renders for that audio.

Lip Sync / Lip Syncing

The process of matching mouth movements to audio speech. In AI video, lip sync algorithms analyze audio waveforms and generate realistic mouth shapes frame-by-frame. Puppetry uses LivePortrait + Wav2Lip for production-quality lip sync across 65+ languages.

Text-to-Speech (TTS)

Technology that converts written text into spoken audio. Modern TTS systems produce natural-sounding voices with emotion, pacing, and accent control. Puppetry offers 500+ AI voices across 65+ languages.

Neural Voice

A synthetic voice generated by deep neural networks (as opposed to older concatenative TTS). Neural voices sound significantly more natural, with proper intonation, breathing, and emotional range. Leading providers produce voices.

← Back to full glossary