You're looking at a specific version of this model. Jump to the model overview.

lucataco /higgs-audio-v2:f5945a34

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
text
string
The sun rises in the east and sets in the west
Text to convert to speech
temperature
number
0.3

Min: 0.1

Max: 1

Controls randomness in generation. Lower values are more deterministic.
top_p
number
0.95

Min: 0.1

Max: 1

Nucleus sampling parameter. Controls diversity of generated audio.
top_k
integer
50

Min: 1

Max: 100

Top-k sampling parameter. Limits vocabulary to top k tokens.
max_new_tokens
integer
1024

Min: 256

Max: 2048

Maximum number of audio tokens to generate
scene_description
string
Audio is recorded from a quiet room.
Scene description for audio context
system_message
string
Custom system message (optional)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}