You're looking at a specific version of this model. Jump to the model overview.

cjwbw /sadtalker:a519cc0c

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
source_image
string
Upload the source image, it can be video.mp4 or picture.png
driven_audio
string
Upload the driven audio, accepts .wav and .mp4 file
use_enhancer
boolean
False
Use GFPGAN as Face enhancer
pose_style
integer
0

Max: 45

Pose style
expression_scale
number
1
a larger value will make the expression motion stronger
use_eyeblink
boolean
True
Use eye blink
preprocess
None
crop
Choose how to preprocess the images
size_of_image
None
256
Face model resolution
facerender
None
facevid2vid
Choose face render
still_mode
boolean
True
Still Mode (fewer head motion, works with preprocess 'full')

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}