Lipsync models

Updated April 23, 2026See Lipsync app →

Workroom offers several lipsync models to sync a voiceover to a face. The default, Lipsync Labs, takes a video with a talking head. InfiniteTalk is different — it takes a still image and generates multi-person conversational video from it. Lipsync 2 Pro works like Lipsync Labs but produces higher output quality, up to 4K.

Lipsync main interface with the model selector in the top-left corner
Lipsync main interface with the model selector in the top-left corner

Model overview

ModelInputBest for
Lipsync LabsVideo + voiceoverGeneral use, fast turnaround
InfiniteTalkImage + voiceoverMulti-person conversational video generation
Lipsync 2 ProVideo + voiceoverHigher output quality, up to 4K
On a Pro plan, generations with Lipsync Labs are unlimited.

InfiniteTalk — image input

InfiniteTalk is the only model that accepts a static image instead of a video. It generates multi-person conversational video from a still photo — useful when you have a reference image but no recorded footage.

When you select InfiniteTalk, the Video button in the control bar changes to Image.

Switch your model

Open the model selector in the top-left corner of the Lipsync page. Selecting a model immediately updates the input controls in the bottom bar.

Start with Lipsync Labs for most projects. Switch to InfiniteTalk when you only have a photo. Use Lipsync 2 Pro when output resolution matters.
Once you've picked a model, see Prepare source files for input guidelines and Create a lip-sync video to generate your first result.