You're looking at a specific version of this model. Jump to the model overview.

jaaari /zonos:c9eef8e2

Input

*string
Shift + Return to add a new line

Text to generate speech from

file

Path to audio file for voice cloning (optional)

string

Language code for speech generation

Default: "en-us"

string

Model type to use

Default: "transformer"

string
Shift + Return to add a new line

Optionally pass a comma-separated list of 8 floats for your desired emotion vector in the order [Happiness, Sadness, Disgust, Fear, Surprise, Anger, Other, Neutral]. For example: '0.5,0.2,0.0,0.0,0.3,0.1,0.0,0.0'. If empty or invalid, defaults to the built-in neutralish emotion.

Default: ""

number
(minimum: 5, maximum: 30)

Speaking rate in phonemes per second. Default is 15.0. 10-12 is slow and clear, 15-17 is natural conversational, 20+ is fast. Values above 30 may produce artifacts.

Default: 15

integer

Seed for reproducibility (optional)

Output

No output yet! Press "Submit" to start a prediction.