You're looking at a specific version of this model. Jump to the model overview.
subhash25rawat /tts:5f227cf6
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
ref_audio_input |
string
|
the audio to be copied
|
|
ref_text_input |
string
|
|
the word to word transcription of the reference audio
|
gen_text_input |
string
|
the text for which audio need to be generated
|
|
remove_silence |
boolean
|
False
|
remove silences from the generated audio
|
cross_fade_duration |
number
|
1
Min: 0.3 Max: 2 |
None
|
nfe_steps |
integer
|
32
Min: 4 Max: 64 |
Number of denoising steps
|
speed |
number
|
1
Min: 0.3 Max: 2 |
The speed up factor of the generated audio
|
model |
string
(enum)
|
E2-TTS
Options: E2-TTS, F5-TTS, Hindi-TTS |
An enumeration.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}