You're looking at a specific version of this model. Jump to the model overview.

zsxkib /mmaudio:62871fb5

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
prompt
string
Text prompt for generated audio
negative_prompt
string
music
Negative prompt to avoid certain sounds
video
string
Optional video file for video-to-audio generation
duration
number
8

Min: 1

Duration of output in seconds
num_steps
integer
25
Number of inference steps
cfg_strength
number
4.5

Min: 1

Guidance strength (CFG)
seed
integer

Min: -1

Random seed. Use -1 or leave blank to randomize the seed
image
string
Optional image file for image-to-audio generation (experimental)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}