Lipsync troubleshooting

Updated April 22, 2026See Lipsync app →

Most lip-sync issues come from source file quality or model mismatch. This guide covers the most common problems and how to resolve them.

Generate button stays grey

The Generate button only activates when both a source file and a voiceover are selected. If it stays grey, at least one input is empty.

Click Video (or Image if using InfiniteTalk) and select a file, then click Voiceover and select an audio track. See Create a lip-sync video for the step-by-step flow.

Lips don't sync accurately

Sync errors usually come from voiceover pacing. Long pauses, very fast speech, or background noise in the audio make lip tracking harder. Trim silence from the start and end of your voiceover before uploading. You can do this in Voice Studio or any audio editor.

Source video quality also matters — shaky footage or a face that moves frequently out of center reduces accuracy. A steady video with a face in the center works best.

Face is not detected

If generation produces no visible lip movement, the model may not have found a usable face in the source. Common causes:

  • Multiple faces in the frame
  • Face too small or partially out of frame
  • Low contrast or very dark footage
Use a clip with one clearly visible face, close to the center, with good lighting. See Prepare source files for more input guidance.

Output quality is lower than expected

The default model, Lipsync Labs, is optimized for speed. For better output resolution — up to 4K — switch to Lipsync 2 Pro. If you're starting from a still photo, InfiniteTalk is the only model that accepts image input.

See Lipsync models for a full comparison.

Processing takes longer than expected

Processing time scales with voiceover length. A 30-second voiceover generates faster than a 5-minute one. This is expected behavior — longer audio requires more processing. There's no way to speed it up within the app.