zsxkib/thinksound:40d08f9f | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
video	string		Input video file (supports various formats)
caption	string		Caption/title describing the video content (optional)
cot	string		Chain-of-Thought description providing detailed reasoning about the desired audio (optional)
cfg_scale	number	5 Min: 1 Max: 20	Classifier-free guidance scale. Higher values follow conditioning more closely but may reduce creativity
num_inference_steps	integer	24 Min: 10 Max: 100	Number of diffusion denoising steps. More steps = higher quality but slower generation
seed	integer		Random seed for reproducible outputs. Leave empty for random seed

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}