You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
prompt |
string
|
|
Text prompt for generated audio
|
negative_prompt |
string
|
music
|
Negative prompt to avoid certain sounds
|
video |
string
|
Optional video file for video-to-audio generation
|
|
duration |
number
|
8
Min: 1 |
Duration of output in seconds
|
num_steps |
integer
|
25
|
Number of inference steps
|
cfg_strength |
number
|
4.5
Min: 1 |
Guidance strength (CFG)
|
seed |
integer
|
Min: -1 |
Random seed. Use -1 or leave blank to randomize the seed
|
image |
string
|
Optional image file for image-to-audio generation (experimental)
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}